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This article comprises a review of both the quasi-probability representations of infinite- 
dimensional quantum theory (including the Wigner function) and the more recently 
defined quasi-probability representations of finite-dimensional quantum theory. We focus 
on both the characteristics and applications of these representations with an emphasis 
toward quantum information theory. We discuss the recently proposed unification of the 
set of possible quasi-probability representations via frame theory and then discuss the 
practical relevance of negativity in such representations as a criteria for quantumness. 
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I. INTRODUCTION 

The quasi-probability representations of quantum the- 
ory evolved from the phase space representations. Phase 
space is a natural concept in classical theory since it is 
equivalent to the state space. The idea of formulating 
quantum theory in phase space dates back to the early 
days of quantum theory when the Wigner function was 
introduced (Wigner, 1932 1. The term quasi-probability 



refers to the fact that the function is not a true prob- 
ability density as it takes on negative values for some 
quantum states. 

The Wigner phase space formulation of quantum the- 
ory is equivalent to the usual abstract formalism of quan- 
tum theory in the same sense that Heisenberg's matrix 
mechanics and Schrodinger's wave mechanics are equiv- 
alent to the abstract formalism. The Wigner function 
representation is not the only quasi-probabilistic formu- 
lation of quantum theory. However, in most, if not all, 
of the familiar quasi-probability representations the kine- 
matic or ontic space of the representation is presumed to 
be the usual canonical phase space of classical physics. 
In the broader context of attempting to represent quan- 
tum mechanics as a classical probability theory, the clas- 
sical state space need not necessarily correspond to the 
phase space of some classical canonical variables. But 
even these more general approaches are not applicable to 
describing quantum systems with a finite set of distin- 
guishable states. 



Recently, analogues of the Wigner function for finite- 
dimensional quantum systems have been proposed. 
There are many adequate reviews of infinite dimensional 
phase representations Q which we complement by review- 
ing the recently defined quasi-probability representations 
of finite-dimensional quantum states. Moreover, we re- 



view a unifying formalism proposed in references (Fer- 



rie and Emerson 2008 2009 Ferrie et al. 20101 which 



encompasses all previously constructed quasi-probability 
representations and provides a means of defining the class 
of all possible quasi-probability representations. 

While entanglement is the most notable criteria for 
quantumness, it can only be defined for composite sys- 
tems. A non-negative quasi-probability function is a true 
probability distribution, prompting some authors to sug- 
gest that the presence of negativity in this function is 
a defining signature of non-classicality for a single sys- 
tem. Thus, a central concept in studies of the quantum- 
classical contrast in the quasi-probability formalisms of 
quantum theory is the appearance of negativity. How- 
ever, it should be understood that any particular rep- 
resentation is non-unique and to some extent arbitrary 
since a state with negativity in one representation is pos- 
itive in another. The concept of entanglement arose from 
the study of nonlocality. This, and contextuality, are per- 
haps the most famous notions of quantumness. We will 
discuss the relationship between these and negativity in 
the final section. Nonlocality and contextuality arc terms 
often used as shorthand for the phrases "there does not 
exist a local hidden variable model for quantum theory" 
and "there does not exist a non- contextual hidden vari- 
able model for quantum theory" . Negativity is a similar 
concept. Moreover, since the assumptions which go into 
the meta-concept of hidden variable model are common 
to each notion, there is a sense in which it is convenient 
to call them "equivalent" ( Spekkens 2008 1 . 



The outline of the paper is as follows. In section 
|TT| we see how quasi-probability arises quite naturally 
when trying to apply standard probabilistic constraints 



on quantum theory. In section III the quasi-probability 
representations on infinite dimensional Hilbert space are 
reviewed. Those defined for finite dimensional Hilbert 
spaces are review in section |IV| Both are shown to be 
part of a unified formalism utilizing frames in section [V] 
Negativity as a criteria for quantumness and its relation 
to contextuality and nonlocality are discussed in section 
VII The final section summarizes what we have learned. 



II. FROM PROBABILITY TO QUASI-PROBABILITY 

A. Operational theories, ontological models and 
non-contextuality 



An operational theory (Hardy 2001a Harrigan and 



Rudolph 2007 Spekkens 2005) is an attempt to math- 
ematically model a real physical experiment. The con- 
cepts in the theory are preparations, systems, measure- 
ments and outcomes. A preparation P is a proposition 
specifying how a real physical system has come to be 
the object of experimental investigation. We will reason 
about a set of mutually exclusive preparations V . The 
system is assumed to then be measured according to some 
measurement procedure which produces an unambiguous 
answer called the outcome. The measurement procedure 
is specified by a proposition M belonging to a set of mu- 
tually exclusive possibilities Ai . The outcome is specified 
by a proposition k which is assumed to belong to a set of 
mutually exclusive and exhaustive possibilities K. This 
means that, in any given run of the experiment, exactly 
one of K is true. 

For example, P could be the setting of the knob on a 
"black box" which sends objects to another "black box" 
with knob setting M which outputs some sensory cue 
(e.g. an audible "click" or flash of light) labeled k as 
each system passes through. Note that P and M need 



not be statements about devices in a laboratory (Schack 



Hillery et al. 


1984 


Lee 


1995 



2003); P could be a statement about a photon arriving 
from the sun, for example. 

For a fixed preparation and measurement the outcome 
may not be deterministic. The role of the theory is then 
to describe the probability of each each outcome condi- 
tional on the the various combinations of preparations 
and measurements. That is, the theory should tabulate 
the numbers Pr(fc|P A M) for all k, P and M. For fixed 
P and M, the mutually exclusive and exhaustive prop- 
erty of K implies J^k P*(tyP A M) = 1. An operational 
theory is then a specification (V,M,K,{Pr(k\P AM)}). 

Quantum theory is an example of an operational the- 
ory where each preparation P 6 V is associated with a 
density operator pp £ B(%) via the mapping p : V — > 
D("H) (for a explanation of the notation used in this arti- 
cle, see the Appendix). In general, this mapping is not re- 
quired to be injective or surjective; different preparations 
may lead to the same density operator and there may not 
exist a preparation which leads to every density opera- 
tor. Similarly, each measurement M G M and outcome 
k € K is associated with an effect Eu,k & E('H) via the 
mapping E : M. X K — > E(%). Again this mapping need 
not be injective or surjective. Quantum theory prescribes 
the probability distributions Pr(/c|P A M) = T[(p P E M k ) 
which is called the Born rule. Since if is a set of mutu- 
ally exclusive and exhaustive possibilities, for fixed M, 
{E M ,k} G POVM(H). 

Notice that an operational theory only specifies the 
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probabilities given the preparations in the set V . Sup- 
pose, for example, we are told a coin is tossed which 
determines which of two preparations procedures are im- 
plemented in a laboratory experiment. The operational 
theory does not offer the probability of a given outcome 
conditional on this information. However, the laws of 
probability provide us with the tools necessary to derive 
the desired probabilities. In general, the task is, given 
any disjunction \A Pi of propositions (the set of which 
is denoted D(V)) from the set of preparations V, deter- 
mine the probability distribution Pr(fc| \A Pi AM). Since 
the set of preparations is mutually exclusive, the laws of 
probability dictate 

Pr(fc| V, Pi A M) = Pr(k\Pi A M) Pr(P 4 ). (1) 



Similarly, for any disjunction \J . Mj of propositions from 
the set of measurements A4 



Pr(fc|P A Vj Mj) = Pr(k\P A Mj)Vx(Mj 



(2) 



Putting Equations ([I]) and ^ together yields 

Pr(fc| V, Pi A V/j Mj) = Pr ( fc l^ A Mj) P <P) p < m j)- 

(3) 

Consider the example of quantum theory again. Let 
Vj Pi be an arbitrary disjunction of preparations from 
V. Then for fixed k and M 

Pr(fc| Vj Pi AM) = ^ Pr(k\Pi A M) Pr(P 2 ) 

i 

= ^Tr(pp^ M ,fc)Pr(P 4 ) 



Tr (^2-PT(Pi)p Pi E M ,k 



suggesting we define the function p : D(V) —> B(W) 

p--\l i Pi^Y,i^{Pi)PP i - (4) 
Similarly, let us define a function E : D{M) x]{4 E(7i) 



E:(\J j M j ,k)^j: j Px(Mj)E Mj , k 



(5) 



Now, using p and E and defining Pjj := \/. Pj and = 
V • Mj we have 



Pr(fc| Vi P A V, Mj) = Tr(p PD E MD<k ). 



(6) 



This works for the probability distributions of any opera- 
tional theory and hence the set (D(V), D(M), K, H, p, E) 
is an operational theory itself. Unless specified oth- 
erwise we will simply assume this analysis has been 



done when given an operational theory. This is equiv- 
alent to assuming the p and E of an operational theory 
(V,M.,K,'H,p,E) have ranges which are convex sets. 

An ontological model is an attempt at interpreting an 
operational theory as an effectively epistemic theory of 
a deeper model describing the real state of affairs of the 
system. An ontological model posits a set A of mutually 
exclusive and exhaustive ontic states A. Each prepara- 
tion P is assumed to output the system in a particular 
ontic state A. However, the experimental arrangement 
may not allow one to have knowledge of which state was 
prepared. This ignorance is quantified via a conditional 
probability distribution Pr(A|P). When this probability 
is viewed as a mapping V — > Prob(A), from prepara- 
tions to the set of probability distributions on A, it is 
neither injective or surjective. That is, in general, dif- 
ferent preparation may give the same probability distri- 
bution of A and, certainly, not all probability distribu- 
tions are realized. A measurement M may not deter- 
ministically give an outcome k which reveals the state 
A of the system. Each measurement can then only be 
associated with an conditional probability distribution 
Pr(fc|M A A) e Prob(M). Again, when this probability 
is viewed as a mapping Mxlf-4 Prob(M) it is not as- 
sumed to be injective or surjective. An ontological model 
also requires that A be such that knowledge of A renders 
knowledge of P irrelevant. In the language of probability, 

CI k is conditionally independent of P given A. 

Summarizing, an ontological model is a set 
(P,Al,.fs:,A,{Pr(A|P)},{Pr(£;|M A A)}) such that 
CI holds. As a consequence of CI, the law of total 
probability states 

Pr(fc|P A M) = ^ Pr(k\M A X) Pr(A|P). (7) 

A 

Loosely speaking, non-contextuality encodes the prop- 
erty that operationally equivalent procedures are repre- 



sented equivalcntly in the ontological model (Spckkcns 



2005). Two preparations are indistinguishable opera- 



tionally if no measurement exists for which the proba- 
bility of any outcome is different between the two. A 
ontological model is non-contextual (with respect to it 
preparations) if the probability distributions over the on- 
tic space the operationally equivalent preparations pro- 
duce are equal. Similarly, measurements can be oper- 
ationally equivalent and the ontological model can be 
non-contextual with respect to them. A mathematically 
concise definition of non-contextuality is as follows. 

Definition 1. Let (V,M, K, A, {Pr(A|P)}, {Pr(fc|M A 
A)}) define an ontological model. 

(a) Let P,P' G V . The ontological model is preparation 
non-contextual if for all k £ K, M £ M and A € A 

Pr(fc|P A M) = Pr(fc|P' A M) =>> Pr(A|P) = Pr(A|P'). 
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(b ) Let M, M' E A4 . The ontological model is measure- 
ment non-contextual if for all k E K , P € V and 
A e A 

Pr(fc|PAM) = Pr(fe|PAM') Pr(fc|MAA) = Pr(fc|Af AA). 

(c,l T/ie model is simply called a non-contextual ontolog- 
ical model if it is both preparation and measurement 
non-contextual. 



B. Quantum theory, contextuality and quasi-probability 

Recall that quantum theory as an operational model 
is defined via the mappings p and E. As an example, 
consider only those preparations associated with pure 
states. These pure states are determined by a mapping 
p =: V : V ->■ P(7Q. The model of Be ltrametti and 
Bugajski ( |Beltrametti and Bugajski 1995) posits the on- 
tic space A = P(H) and a sharp probability distribution 



Pr(A|P)dA = 5(X- tp P )dX 



(8) 



for each preparation P. This model suggests that the 
quantum state provides a complete specification of re- 
ality. Recall each measurement procedure is associated 
with a POVM via the mapping E : M x K ^ E(H). 
Each measurement procedure M implies a conditional 
probability 



Pr(fc|Af A A) = (X,E M , k X). 



(9) 



To show this is an ontological model, it remains only to 
verify that Equation ^ is satisfied. It follows that 

Pr(fc|PAAf) = / Pr(fc|A/A A)Pr(A|P)dA 

(X,E M , k X)S(X-^ P )dX 
= (ip P ,EM,kipp), 

which is the Born rule for pure states. The Beltrametti- 
Bugajski model is an ontological model for pure state 
quantum theory which is preparation non-contextual. 
However, the range of the mapping if) in this case is not 
convex. As we will now see, if we looked at all possible 
logical disjunctions of preparations in this model, so that 
range of the new mapping p is D(H), quantum theory ad- 
mits no non-contextual model. The most famous exam- 
ple of an ontological model for pure state quantum theory 
is the de Broglie-Bohm (or Bohmian) theory, which suf- 
fers the same flaw of admitting no non-contextual exten- 
sion to the full (i.e. including density operators) quantum 
theory. This model is discussed later in section [VI. B| 

Lemma 1. Suppose the convex range of E con- 
tains a basis (for W(H)) and the ontological model 



{T>,M,K,A,{Pv(X\P)},{Pv(k\M A A)}) is preparation 
non-contextual. Then, there exists a affine mapping (with 
domain the range of p) p: Ran(/j) — > Prob(A) satisfying 
p(X\p P ) = Pr(X\P). 

Proof. For equivalent disjunctions of preparations 

Tr{ppE M .k) = Pr(fe|PAM) = Pr(fc|P'AM) = Tr(p P ,E Mik ). 

(10) 

Since the range of E is a basis it spans H.(H) and there- 
fore pp = pp> if and only if P and P' are operationally 
equivalent. Then, from the definition of preparation non- 
contextuality, Pr(A|P) ^ Pr(A|P') implies pp ^ pp< . 
Thus the mapping p(X\pp) = Pr(A|P) is well-defined. 
Now we show the mapping p is affine. That is, 

p(X\pp P + (1 -p)pp>) =pp{X\p P ) + (1 -p)fj,(X\p P >) 

for all p € [0,1] and all P, P' E D{V). There is some 
P" E D(P) such that p P » = pp P + (1 - p)pp'. That is 
to say, P" = P V P' while Pr(P) = p and Pr(P') = 1 -p. 
From the non-contextuality assumption this implies 

Pr(A|P") = Pr(A|PVP') 

= Pr(P) Pr(A|P) + Pr(P') Pr(A|P'). 

Applying the definition of p to this yields 



p(X\p P ») = pp(X\p P ) + (1 - p)p(X\p P ,), 
which proves p is affine. 



□ 



Lemma 2. Suppose the range of p contains 
a basis (for W((H) ) and the ontological model 
{V,M,K,A, {Pr(A|P)}, {Pr(£;|M A A)}) is measurement 
non-contextual. Then, there exists a unique affine map- 
ping £ : E{H) ->■ (K) satisfying Pr(fc|Af A A) = £(E Mik ). 

The proof of Lemma [2] is similar to that of Lemma 
[T] Together, the mappings p and £ are called a classical 
representation of quantum theory. 

Lemma 3. A classical representation satisfies, for all 
X E A, all p E V>(H) and all E E E(H), 

(a) p p (X) E [0,1] andJ2\V P W = l , 

(b) £ £ (A) € [0,1] and £i(A) = I, 

(c) Tr(pE) = ^ x p p (X)^ E (X). 



In Reference (Ferrie and Emerson 2008) the name 



"classical representation" was defined to be a set of 
mappings satisfying (a)-(c). Through Lemmas [l] and [2] 
we have shown that the assumption of non-contexuality 
guarantees that these mappings are well defined, convex 
linear and satisfy (a)-(c). However, regardless of how we 
choose to arrive at a pair of convex linear mappings sat- 
isfying (a)-(c), one (or more) of the assumptions we make 
will be false as shown by the following theorem. 
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Theorem 1. A classical representation of quantum the- 
ory does not exist. 

This theorem is implied by the equivalence of a clas- 
sical representation and a non-contextual ontological 



model of quantum theory (Spekkens 2008) and the im 



possibility of the latter ( Spekkens 2005 ) . This theorem 



was proven in reference ( Ferrie and Emerson 2008 ) using 



the notion of a frame and its dual, which we will see later. 
A direct and more intuitive proof was given in reference 



( Ferrie and Emerson 2009 1 . The theorem is also implied 



by an earlier result on a related topic in reference (Busch 



et al. 1993 1 



Now let us relax some assumptions in order to avoid 
a contradiction. First, we will relax the assumption of 
non- context uality. 

Definition 2. A contextual representation of quantum 
theory is a pair of mappings p : '0{'H) x Cp — > -P(A) and 
£ : E(H) x C M -> -P(A) which satisfy, for all A G A, 
all p G H(H) and all E e E(H), all cp e C-p, and all 
cm € Cm, 

(a) fip, Cv (X) € [0,1] and X) A £Wt> M = 1 > 

(b) £, E ,c M (A) G [0, 1] and £ 1)CjM (A) = I, 

(c) Tr(pE) =Ea/W(A)£WA). 

Here C-p and Cm o- r ^ the preparation and measurement 
contexts. 

To the best of our knowledge such a construction does 
not exist in the literature. In the previous section we 
looked at the Beltrametti-Bugajski modej^] which was 
non-contextual for pure states. The model cannot be 
extended to include all states such that it remains non- 
contextual. We could artificially create a preparation 
contextual model by assigning a unique context to each 
convex decomposition of every density operator. It is 
unclear whether such a construction could be useful. It 
is more common, however, is to relax the assumption of 
non-negative probability rather than non-contextuality. 

Definition 3. A quasi-probability representation of 
quantum theory is a pair of affine mappings p : B("H) — >• 
L(A) and £ : E(%) -> L(A) which satisfy, for all A G A, 
all p G D(H) and all E G E(H), 

(a) p p (X) G K and J2\ Mp(A) = 1, 

(b) &j(A) G R and &(A) = 1, 
f C ;iHp£) = £ A/v (A)6?(A). 



It is immediately clear that theorem [T] is equivalent to 
the following "negativity theorem" : 

Theorem 2. A quasi-probability representation of quan- 
tum theory must have negativity in either its representa- 
tion of states or measurements (or both). 

There are many instances of mappings p satisfying 



the first requirement. In reference (Ferrie and Emerson 



2008) it was shown by construction how to find a map- 



ping £ which, together with p, satisfy all of them. Most of 
the mappings p are called phase space functions as they 
conform to added mathematical structure not required 
in definition [3] A phase space representation is then a 
particular type of quasi-probability representation which 
we formally define as follows: 

Definition 4. If there exists a non-trivial symmetry 
group on A, G, carrying a unitary representation U : 
G — > U('H) and a quasi-probability representation satis- 
fying the covariance property U g pU^ h- > {p p (g(X))}\^A 
for all p G D("H) and g G G, then p t— > p p (X) is a phase 
space representation of quantum states. 

It will turn out that all but a few of the representations 
reviewed below satisfy this stronger criterion. 



III. QUASI-PROBABILITY IN INFINITE DIMENSIONAL 
HILBERT SPACE 

Here we will review the quasi-probability distributions 
which have been defined for quantum states living in an 
infinite dimensional Hilbert space - the canonical example 
being a particle moving in one dimension. Since there are 
a myriad of excellent reviews of the Wigner function and 
other phase space distributions [^] our discussion of them 
will be brief. We will mainly focus on those details which 
have inspired analogous methods for finite dimensional 
Hilbert spaces. 

First we start with the familiar Wigner function in 



section III. A The other phase space distributions, such 



as the Husimi function, are bundled up in section III.B 



A. Wigner phase space representation 

The position operator Q and momentum operator P 
are the central objects in the abstract formalism of infi- 
nite dimensional quantum theory. The operators satisfy 
the canonical commutation relations 

[Q,F]=i. 



3 See reference jHillery et al. |[l984fr for a classic and reference ( |Lee| 
2 These comments apply equally to other ontological models for |1995|l for a more recent review of phase space quasi-probability 

pure states, such as the de Broglie-Bohm model. distributions. 
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We are looking for a joint probability distribution p p (p, q) 
for the state of the quantum system. From the postulates 
of quantum mechanics we have a rule for calculating ex- 
pectation values. In particular, we can compute the char- 
acteristic function 

jfar,) ;= / e <tt9+w)^ = Tr(e j(5Q+ " p) p)- 

Since the characteristic function is just the Fourier trans- 
form of the joint probability distribution, we simply in- 
vert to obtain 



fi> P (p, q) 



1 



(11) 



which is the celebrated Wigner {unction of p (Wigner 



1932 ) . The Wigner function is both positive and negative 



in general. However, it otherwise behaves as a classical 
probability density on the classical phase space. For these 
reasons, the Wigner function and others like it came to 
be called quasi-probability functions. 

The Wigner function is the unique representation sat- 
isfying the properties ( |Bertrand and Bertrand 1987) 



Wig(l) For all p, p p (q,p) is real. 
Wig(2) For all pi and p 2 , 

Tr(pip 2 ) = 2tt / dqdp p Pl (q,p)p P2 (q,p). 

JR 2 

Wig(3) For all p, integrating p p along the line aq + bp = c 
in phase space yields the probability that a measure- 
ment of the observable aQ + bP has the result c. 

We can write the Wigner function as 

V P (p,q) = Tr[F(p,g)p] , 

where 

F ^P)~7^I dtdr,eW-'» +i * p -*\ (12) 
[2-kY J R 2 

Thus the properties Wig(l)-(3) can be transformed into 
properties on a set of operators F(q,p) which uniquely 
specify the set in Equation dl3|). These properties are 



Wig(4) F(q,p) is Hcrmitian. 

Wig(5) 2^r{F(q,p)F{q',p')) = 5{q-q')8{p-p'). 

Wig(6) Let P c be the projector onto the eigenstate of aQ+bP 
with eigenvalue c. Then, 

/ dqdp F(q,p)5(aq + bp — c) = P c . 
Jr 2 

These six properties are often the basis for generalizing 
the Wigner function to Unite dimensional Hilbert spaces, 
as we will soon see. 



1. Applications: quantum teleportation 

The applications of the Wigner function are far reach- 
ing and not limited to to physi cs [^| A modern applica- 



tion can be found in reference (Caves and Wodkiewicz 



2004 ) where Caves and Wodkiewicz use the Wigner func- 



tion to obtain a hidden variable model of the continuous- 



variable teleportation protocol (Braunstein and Kimble 
19981 iVaidmanl 119941). Later, in section lIV.Fl we will 



discuss the much simpler discrete-variable teleportation 
protocol. Here, then, we will avoid the details of the 
protocol and focus on the result. It suffices to know the 
following: there are three quantum systems; the goal of 
the protocol is to transfer a quantum state from system 1 
to system 3; the transfer is mediated through the special 
correlations between system 2 and system 3. The suc- 
cess of the protocol is measured by the average fidelity: 
a measure of the closeness of the initial state p = IV'XV'I 
of system 1 and the average final state p on t of system 3. 

Following Caves and Wodkiewicz we define v = q + ip 
and index the Wigner function as p p (y). This is con- 
venient since the protocol is tailored to a quantum opti- 
cal implementation where the outcomes of measurements 
are usually expressed as complex numbers. Initially, the 
state of system 2 and 3 is described by the joint Wigner 
function ^2,3(0:, 

In terms of the Wigner functions, the average fidelity 

is 

T = * J d 2 vd 2 Pp p {v)p Po Jfi). 

This intuitively measures closeness by quantifying the 
overlap of the Wigner functions on the classical phase 
space. The output state, determined by the details of 
the protocol, is 

VpautiP) = J d 2 vd 2 ap p {(3 - v)p,2,3( a > u 

The initial Wigner function p p {v) and the joint Wigner 
function P2,?,{ot,f3) are determined by the particular im- 
plementation of the protocol. The standard quantum 
optical implementation is done using coherent states of 
light. It is easy to show that such states have positive 
Wigner functions^} Thus, the Wigner function provides 
a classical phase space picture of the entire protocol. 

A first step toward performing an experiment requiring 
genuine quantum resources might be to avoid the above 
classical description by teleporting a non-coherent quan- 
tum state. Caves and Wodkiewicz have devised a classi- 
cal explanation for this case as well. The new model in- 
volves a randomization procedure which transforms the 



For example, see reference jDragoman] |2005| for a recent review 
of the applications of the Wigner function in signal processing. 
5 More difficult is to show that coherent states are the only states 
with positive Wigner functions | |Hudson[ |1974^ . 
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initial non-coherent state into a coherent one thus giv- 
ing it a positive Wigner function. However, it can be 
shown that within such a model, the fidelity is bounded: 
J 7 < 2/3. So 2/3 emerges as a "gold-standard" since tele- 
porting a non-coherent state with fidelity T > 2/3 avoids 
this classical phase space description. 



where cPa — (l/2)dqdp. Notice that this immediate im- 
plies the following duality condition between the P- and 
Q-representation: 

Tr(pf/)= J d 2 aP p (a)Q p ,(a). (18) 

These functions and further discussed in section IVI.AI 



B. Other phase space representations 

Another class of solutions to the ordering problem is 
the association e^ 9+JI?p h- > e l ^ +lr,p '/(£, rf) for some arbi- 
trary function / [^J 

Consider again the classical particle phase space M 2 
and the continuous set of operators 

F(Q,P) ■■= 7^2 [ dSdneVto-V-Hito-Vf&r)). (13) 
{2ttY J r2 



The / dependent distributions 



(14) 



define quasi-probability functions on phase space alter- 
native to the Wigner function, which is simply the / = 1 
special case of this more general formalism. 

Besides the Wigner function, the most popular choices 
of / are 

/(£,??) =e ± ^ 2 +A 



which give, via equation (14), the Glauber-Sudarshan 



(Glauber 1963 Sudarshan 



1963) and Husimi (Husimi 



1940) functions, respectively. These two mappings are 



sometimes referred to as the P- and Q-representations 
(not to be confused with position and momentum rep- 
resentations). We will follow the usual convention by 
introducing the annihilation operator 



a=^(Q + iP) 



(15) 



and the coherent states defined via a\a) — a\a), where 
we write a = q + ip. Then the Husimi function can be 
conveniently written 



f 1 

Q(a) := p T p {q,p) = -(a, pa). 

7T 



(16) 



The Glauber-Sudarshan function p — > P(a) can be ex- 
pressed implicitly through the identity 



(17) 



p= Id aP(a)\a)(a\, 



1. Application: quantumness witness 

Here we will discuss a more recent application of the P 
and Q functions of interest in quantum information and 
foundations Q We will concern ourself with a particular 
notion of "non-classicality" defined in reference (Alicki 
et al.\ |2008[ | Afield and Ryn[ |2008| |Korbicz et al.\ |2005[ ). 

Consider two observables represented by the self- 
adjoint operators R and S. Another observable W(R, S) 
written as an ordered power series of R and S is a quan- 
tumness witness if it possesses at least one negative eigen- 
value and the function w(r, s) obtained by replacing R 
and S with its spectral elements is positive: w(r, s) > 
for all r £ spec(i?) and s £ spec(5') (spec(-) denotes the 
spectrum of the observable) . 

Now consider R = Q and S = P. the usual position 
and momentum operators. Recalling the parameteriza- 
tion in terms of the annihilation operator in equation 



( 15 1, we define 



W{a) 



E 



for any choice of coefficients c mn , such that 
w(a) = c mn a m a n > 

and W possesses at least one negative eigenvalue (an 
example of such a construction is W(a) = (a)) 2 a 2 — 
2ma'a + m 2 for m > 1). Then, in terms of the Q and 
P-representation defined above, we have 

(W) = Tr(pW) = I d 2 a(a 7 Wa,P){a) = / d 2 aw(a)P{a) 



(19) 

In optics especially, coherent states are considered clas- 
sical. From equation (17) we see that if P(a) > 0, the 



quantum state is a statistical mixture of coherent states 
and hence just as classical. So if p is a classical state, 



equation ( 19 ) tells us (W) > 0. Therefore, if we measure 



(W) < 0, we can rule out the classical model of statistical 
mixtures of coherent states; we can say W(a) detects the 
quantumness of the states. 



6 This is very closely related to the s-ordered Cahill-Glaubcr for- 
malism jCahill and Glauber| [1969) . See Table 1 of jLee| |1995) 
for a concise review of the traditional choices for /. 



7 The P and Q functions are powerful visualization tools promi- 
nently used in the areas of quantum optics and quantum chaos 
| |Takahashi||1989[ |. See ( |Schleich| |20u"[) for an overview of the 
applications in quantum optics. 



IV. QUASI-PROBABILITY IN FINITE DIMENSIONAL 
HILBERT SPACE 

Nearly all definitions of quasi-probability distributions 
for finite dimensional Hilbert spaces have been motivated 
by the Wigner function. The earliest such effort was 
by Stratonovich and is reviewed in section |IV.A| The 
Stratonovich phase space is a sphere and hence contin- 
uous. Later, many authors have define Wigner function 
analogs on discrete phase spaces. A sampling, with a bias 
towards those which have found application in quantum 
information theory is given in sections |IV.B||IV7F| 

There also exist quasi-probability distributions which 
where introduced to solve various problems far re- 
moved from proposing a finite dimensional analog of the 
Wigner function. Sections |I V . G||TvTj| review those quasi- 
probability distributions which do not have a canonical 
phase space structure and hence form a somewhat weaker 
analogy to the Wigner function. 

We note that there exists many other quasi-probability 
distributions defined on discrete phase spaces which 



The continuous set of operators A(n) is called a kernel 
and we note it plays the role of the more familiar phase 
space point operators in the latter. Requiring that Equa- 



tion (20 1 hold changes the postulates to new conditions 



on the kernel 



dn A(n) = 1. 



A(n)t=A(n), 
d 

47T J §2 

/ dnTr(A(n)A(m))A(n) = A(m), 
A( 5 -n) = [/ g A(n)/yt, g € SU(2). 



(21) 
(22) 

(23) 
(24) 



These postulates are the spherical analogies of properties 
Wig(4)-(6) (again, modulo the normalization condi tion). 
Defining s = ^_as the spin, Heiss and Weigert ( Heiss 
and Weigert 2000) provided a concise derivation of 2^ s 
unique kernels satisfying these postulates They are 



2 s 



are not reviewed here (Chaturvedi et al. 2005 2006 


Cohen and Scully |1986 


Feynman 1987| Galetti and 


de Toledo Piza "]l988 


Gross 2006 2007 Hakioglu 


1998[ |Hannay and Berry 


1980[ (Luis and Perina[ |1998 


Mukundal |2004| |Opatrny et aLl|1996||RivasH1999[|Vour- 


das 2004). 



A(n) = E E 



m= — s 1=0 



21 + 1 

2s~+i 



C^ s m <Mn)C(n), (25) 



where C denotes the Clebsch-Gordon coefficients. Here, 
4> m (n) are the eigenvectors of the operator S • n, where 
S = (X, Y, Z); and e; = ±1, for I = 1 ... 2s and e = 1. 



Heiss and Weigert relax the postulates Equations ( 2f ) 



A. Spherical phase space 



(24) on the kernel A(n) to allow for a pair of kernels A n 
and A m . The pair individually satisfy Equation (21), 



while one of them satisfies Equation ( 22 ) and the other 



Here we will be concerned with a set of postulates put Equation (|24j). Together^the pair must satisfy the gen- 



forth by Stratonovich dStratonovichl (19571). The aim of eralization of Equation (23] 



Stratonovich was to find a Wigner function type map- 
ping, analogous to that of a infinite dimensional system 
on K 2 , of a d dimensional system on the sphere S 2 . The 
first postulate is linearity and is always satisfied if the 
Wigner functions on the sphere satisfy 



d_ 

in 



[ dnTr(A n A m )A n = A r 



(26) 



A pair of kernels, together satisfying Equation (26), is 
given by 



/x p (n) = Tr(pA(n)), 



(20) 



where n is a point on S 2 . The remaining postulates on 
this quasi-probability mapping are 

M n )* = Mp(n), 
d f 

-j s2 dn^ p (n) = l, 
d f 

— dn p Pl (n)p P2 (n) = Tr(pip 2 ), 

47T J S 2 

/i (s . p) (n)=/x p (n)s, ge SU(2), 

where g ■ p is the image of U g pU^ and U : SU(2) — > 
VH is an irreducible unitary representation of the group 
SU(2). These postulates are analogous to Wig(l)-(3) for 
the Wigner function modulo the second normalization 
condition (which could have be included in the Wigner 
function properties). 



m— — s 1=0 
s 2s 



2s + l 



m mVm 



»</4( n ) 



A n = E E 



7z 



m=—s 1=0 



■ i 21 + 1 
2s + 1 



Cm mfim 



where each ji is a finite non-zero real number and 70 
The original postulates are satisfied when 7; = ■y^ 1 e 



= 1. 



The major contribution of reference (Heiss and 



Weigert 2000) is the derivation of a discrete kernel 



A,, := A n , for v = 1 . . . d 2 which satisfies the discretized 



8 This was also shown earlier jBrif and Mann| |1999| |Varilly and 
Graciabondia"| |1989[) - s ee also references | |Arecchi et al.\ |1972| 
|Scully and W6dkiewicz||1994[ |. 
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postulates 



At = A„, 

j ^ Tr(Aj,A M )A„ = A M , 



//=1 



A 9 .„ = C/gA.C/t 5 e SU(2). 



(27) 
(28) 

(29) 
(30) 



where a = (X,Y,Z) are the usual Pauli operators. In 
NMR experiments many qubits are employed to perform 
quantum information tasks such as error correction and 
teleportation. Suppose there are n qubits with total 
Hilbert space dimension 2™. We choose an n-fold ten- 
sor product of the qubit kernels. Explicitly, they are 



1 

7=1 



A n = 



The subset of points n„ is called a constellation. The lin- 
earity postulate is not explicitly stated since it is always 
satisfied under the assumption 



1 

(47r) r 



)(l + 3n-a). 



(32) 
(33) 



i=i 



p -> ii p {y) = Tr(pA^) 



(31) 



Equation (291 is called a duality condition. That is, it is 



only satisfied if A v and A p are dual bases for H("H). In 
particular, 



The quasi-probability function is given by 

Mp (n)=Tr(pA n ). 

As expected, in general, this function is both positive and 
negative. 

The quantum state of an NMR experiment is of the 
form 



1 



Tr(A„A M ) = 5, 



P=(l-e)^l + ePl, 



(34) 



Although the explicit construction of a pair of discrete 



kernels satisfying Equations ( 27 1-( 30 ) might be computa- 



tionally hard, their existence is a trivial exercise in linear 
algebra. Indeed, so long as A v is a basis for W(H), its 
dual, A M , is uniquely determined by 



where the Gram matrix G is given by 



where p\ is arbitrary but often chosen to be a specific 
pure state. The parameter scales as 



So we have 

It is easy to determine the lower bound 

1 - e 62 2 "- 1 



Mp(n) > 



(47r) n (47r) n 



The authors of reference (iHeiss and Weigertl 120001) note Thus, provided 



that almost any constellation leads to a discrete kernel 
Aj, forming a basis for M(T~L). The term almost any here 
means that a randomly selected discrete kernel will form, 
with probability 1, a basis for W(H). 



1. Application: NMR quantum computation 

The spherical quasi-probability functions for qubit sys- 
tems (d — 2) were put to use by Schack and Caves for 
the purpose of obtaining a classical model of nuclear mag- 
netic resonance (NMR) experiments designed to perform 
quantum information tasks (Schack and Caves 19991. 



e < 



1 



1 + 2 2 ™" 1 : 



/ i p( n ) > and we have a representation of NMR quan- 
tum states in terms of classical probability distributions 
on a classical phase space. Note however, that this defini- 
tion of "classical" omits the reasonable requirement that 
it also be efficient. Indeed, the authors note that, for 
typical experimental values of the scaling parameter in e, 
such a classical representation is valid for n < 16 qubits. 
The spherical phase space representations have also 



been put to good use in visualizing decoherence (Ghose 



For a single qubit we choose the kernels 



1 



et al. 2008) and photon squeezing (Shalm et al. 2009). 



A n = -(l+n-a), 
A==^(l + 3n.<7), 



B. Wootters discrete phase space representation 



In reference (Wootters 1987), Wootters defined a dis- 



crete analog of the Wigner function. Associated with 
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each Hilbert space H of finite dimension d is a discrete 
phase space. First assume d is prime. The prime phase 
space, $, is a d x d array of points a = (q,p) € x Z^. 

A fee, A, is the set of d points satisfying the linear 
equation aq + bp — c, where all arithmetic is modulo d. 
Two lines are parallel if their linear equations differ in 
the value of c. The prime phase space $ contains d + 1 
sets of d parallel lines called striations. 

Assume the the Hilbert space H has composite dimen- 
sion d = d\di • ■ ■ dk- The discrete phase space of the en- 
tire d dimensional system is the Cartesian product of two- 
dimensional prime phase spaces of the subsystems. The 
phase space is thus a (d\ x di) x (d 2 x d 2 ) x • • ■ (xd fc x dk) 
array. Such a construction is formalized as follows: 
the discrete phase space is the multi-dimensional array 
$ = $iX$ 2 X'''X$t, where each $^ is a prime phase 
space. A point is the /c-tuple a — (ai, «2> . . . , of 
points ai = (qi,Pi) in the prime phase spaces. A line 
is the fc-tuple A = (Ai, A2, . . . , \k) of lines in the prime 
phase spaces. That is, a line is the set of d points satis- 
fying the equation 

(a 1 q 1 +b 1 p 1 ,a 2 q 2 +b2P2, a k qk+bkPk) = (ci, c 2 , . . . , c fc ), 

which is symbolically written aq + bp = c. Two lines are 
parallel if their equations differ in the value c. As was 
the case for the prime phase spaces, parallel lines can be 
partitioned into sets, again called striations; the discrete 
phase space $ contains (di + l)(d 2 + 1) ••■ (dk + 1) sets 
of d parallel lines. 

The construction of the discrete phase space is now 
been complete. To introduce Hilbert space into the dis- 
crete phase space formalism, Wootters chooses the fol- 
lowing special basis for the space of Hermitian operators. 
The set of operators {A a : a € <!>} acting on an d di- 
mensional Hilbert space are called phase point operators 
if the operators satisfy 



Woo(4) For each point a, A a is Hermitian. 
Woo(5) For any two points a and /3, Tv(A a Ap) 



Although the sets of phase point operators are unitarily 
equivalent, the induced labeling of the lines associated to 
the chosen set of phase point operators are not equivalent. 
This is clear from the fact that unitarily equivalent PVMs 
do not project onto the same basis. 



The choice of phase point operators in reference ( Woot- 



ters 



1987) will be adopted. For d prime, the phase point 



operators are 



A, 



j,m—0 



(35) 



where u is a d'th root of unity and X and Z are the 
generalized Pauli operators (see appendix [A]). For com- 
posite d, the phase point operator in $ associated with 
the point a — (aj., a 2 , . . . , a^) is given by 



A n — A n 



A r 



■A r 



(36) 



where each A a . is the phase point operator of the point 
oti in $j. 

The d 2 phase point operators are linearly independent 
and form a basis for the space of Hermitian operators 
acting on an d dimensional Hilbert space. Thus, any 
density operator p can be decomposed as 

where the real coefficients are explicitly given by 



V>p(Q,P) = ^Tr(pA(g,p)). 



(37) 



This discrete phase space function is the Wootters dis- 
crete Wigner function. This discrete quasi-probability 
function satisfies the following properties which are the 
discrete analogies of the properties Wig(l)-(3) the origi- 
nal continuous Wigner function satisfies. 



dS, 



a/3- 



Woo(6) For each line A in a given striation, the operators 
P\ = 2 A a form a projective valued measurement 

(PVM): a set of d orthogonal projectors which sum 



Woo(l) For all p, p p (q,p) is real. 
Woo(2) For all p\ and p 2 , 

Tr(pip 2 ) = dy^p Pl (q,p)p P2 (q,p). 



to identity. 



Notice that these properties of the phase point operators 
Woo(4)-(6) are discrete analogs of the properties Wig(4)- 
(6) of the function F defining the original Wigner func- 
tion. This definition suggests that the lines in the discrete 
phase space should be labeled with states of the Hilbert 
space. Since each striation is associated with a PVM, 
each of the d lines in a striation is labeled with an or- 
thogonal state. For each $, there is a unique set of phase 
point operators up to unitary equivalence. 



Woo(3) For all p, summing p p along the line A in phase 
space yields the probability that a measurement of 
the PVM associated with the striation which con- 
tains A has the outcome associated with A. 



1. Application: entanglement characterization 



In (Franco and Penna 2006), Franco and Penna relate 
the negativity of Wootter's discrete Wigner function to 
entanglement. Recall that a bipartite density matrix p is 
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separable if it can be written as a convex combination of 
the form 

E(l) „ (2) 



for all k, where p"p and p^ 2 ' are states on the individual 



C. Extended discrete phase space 



In reference (Cohendet et al. 1988), Cohendet et al 



define a discrete analogue of the Wigner function which 
is valid for integer spin[^] That is, dim('H) = d is assumed 
to be odd. Whereas Wootters builds up a discrete phase 
space before defining a Wigner function, the authors of 



subsystems. Let <£>! and <& 2 be the DPS associated with ( jCohendet et al.\ 1988|) implicitly define a discrete phase 
p^ and respectively. 



The Wootters representation of a density matrix of the 
form p = p (1) (8ip (2) is given by p p (a) — p,y (ai)p,p (a 2 ), 
where a — (ai,ct2) € $i x $2- This can be shown as 
follows: 

Mp(«1i«2) = ^Tr(p {1) A ai ®p {2) A a2 ) 

= \ E ^\Pi)^\h)Tr{A^A ai 
/3ie*i,/3 2 e*2 

= M^ ) (« 1 )/if)(a 2 ). 

Thus, separability can be recast entirely in terms of the 
discrete phase space. That is, a discrete Wigner function 
is separable if it can be written 



space through the definition of their Wigner function. 
Consider the operators 

W mn <p k = w 2n(fc ~ m) 0fe_2m, 

with ra,n£Zj and 0^ are the eigenvectors of Z. Then, 
the discrete Wigner function of a density operator p is 



1. 



(39) 



p p (a) 



fe 



) fe /i^ 2) (a 2 )A 



(38) 



(1) 'a0/4 2) (a a ) 



1—/3 



for 



else it is entangled. 

The two qubit product state p p {a) = p p 
with /Xp (ai) = I for some a\ and p p 2 \c<2 / — , 

some «2 will have the most negative value for a separable 
1 Pk 

state, namely g v . Thus, if a two qubit Wigner func- 
tion has a value strictly less that 1 ~ g v/ ^ , it is entangled. 
Since entanglement is considered non-classical, negativity 
of the Wigner function (below some threshold) is asso- 
ciated with non-classicality. However, even if a Wigner 
function is positive on all of phase space, it can still be 
entangled. Therefore, Franco and Penna have found a 
new sufficient condition for entanglement in two qubits. 



Ap 2 A a2 ) 

where P is the parity operator. 

The authors call the operators A qp = W qp P Fano op- 
erators and note that they satisfy 

Tr(AgpZ\g/p/ ) — dSqq'Spp' 7 

x p—2k • 

The Fano operators play a role similar to Wootters' phase 
point operators; they form a complete basis of the space 
of Hermitian operators. The phase space implicitly de- 
fined through the definition of the discrete Wigner func- 
tion (39) is Id x Id- When d is an odd prime, this phase 
space is equivalent to Wootters discrete phase space. In 
this case the Fano operators are A qp = A(_ q p y This can 
seen by writing the Wootters phase point operators as 

A (ap) = -X 2q Z 2p Puj 2qp . 
1 ' d 

Let a € {±1}- The extended phase space is x x 
{±1}. Define the new Wigner function 



For a necessary condition, the authors of (Franco and 



Penna 2006) turn to the positive partial transpose con- 



dition (Horodecki 1997 Peres 1996). The result is a two 



qubit state p is separable if and only if both the discrete 
Wigner function of p and the discrete Wigner function of 
p 1 - 2 (the partial transpose) are non-negative everywhere 
on the discrete phase space. 

Wootters discrete Wigner function has also found ap- 



plication in quantum teleportation (Koniorczyk et al. 



2001). The authors have found the discrete phase space 



representation of the teleportation protocol much clearer 
especially when considering quantum systems with much 
larger than two dimensional Hilbert spaces. 



1 

4d 



+ o-p° dd (q,p) 



This function is satisfies the positivity and normalization 
requirements of a true probability distribution. 



1. Application: Master equation for an integer spin 

In the same paper, Cohendet et al show the quantum 
dynamical equation of motion can be represented in the 
extended phase space as a classical stochastic process. 



This difficulty was overcome in a later paper ^Cohendet et al\ 
[1990] . 
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This is achieved by showing the time evolution of the 
discrete Wigner function is 



d_ 
dt 



(j, p (q,p,a;t) 



q',p',cr' 



for a suitable choice of jump moments A. This is in 
the form of the master equation of a Markov process. 
The authors interpret this result as follows: "Quantum 
mechanics of an integer spin appears as the mixture of 
two classical schemes of a spin. However at random times 
the schemes are exchanged." 



D. Even dimensional discrete Wigner functions 



In reference (Lconhardt 1995), Leonhardt defines dis- 



crete analogues of the Wigner function for both odd and 
even dimensional Hilbert spaces. In a later paper ( Leon- 



hardt 1996), Leonhardt discusses the need for separate 
definitions for the odd and even dimension cases. Naively 
applying his definition, or that of Cohendet et al, of the 
discrete Wigner function for odd dimensions to even di- 
mensions yields unsatisfactory results. The reason for 
this is the discrete Wigner function carries redundant 
information for even dimensions which is insufficient to 
specify the state uniquely. The solution is to enlarge 
the phase space until the information in the phase space 
function becomes sufficient to specify the state uniquely. 

Suppose dim('H) = d is odd. Leonhardt defines the 
discrete Wigner function as 

V L o C °(q,P) = -,Tr(pX 2 ' 1 Z 2 PPu; 2 ' !P ). 
H d 

Lconhardt 's definition of an odd dimensional discrete 
Wigner function is unitarily equivalent to the Cohendet 
et al definition. That is, p^ co (q,p) = p° dd (-q,p). To 
define a discrete Wigner function for even dimensions, 
Leonhardt takes half-integer values of q and p. This 
amounts to enlarging the phase space to 'Lid x ^2d- Thus 
the even dimensional discrete Wigner function is 



/C en (<?,p) 



1 

2d 



Ti(pX q Z p Puj^), 



where the operators 



a even 

qp 



2d 



,X"Z*Pw 



could be called the even dimensional Fano or phase point 
operators. Of course, these operators do not satisfy all 
the criteria which the Fano operators (in the case of Co- 
hendet et al) or the phase point operators (in the case 
of Wootters) satisfy; they are not orthogonal, for exam- 
ple. Moreover, they are not even linearly independent 
which can easily be inferred since there are 4d 2 of them 
and a set of linearly independent operators contains a 
maximum of d 2 operators. 



1. Application: quantum computation 

Leonhardt's discrete Wigner function has been used to 
visualize and gain insights for algorithms expected to be 



performed on a quantum computer ( jBianucci et al. 2002 
Miquel et al. . 2002a|b ). For each step in a quantum algo- 



rithm the state p(t) of the quantum computer is update 
via some unitary transformation 

p(t+l) = Up(t)U*. 

This can be represented in the discrete phase space as 

H P (Q,P;t) =2ZZ{p,q\p',q')n p (q',p';t), 

q'p' 

where Z can be easily obtained from U. This resembles 
the update map for the probabilities of classical stochas- 
tic variables. However, the properties of Z imply that not 
all admissible maps are classical; they do not connect sin- 
gle points in phase space and hence are "nonlocal" . In 



reference ( |Miquel et al. 2002a) the authors identify a 
family of classical maps which can be efficiently imple- 
mented on a quantum computer. The authors admit that 
the ultimate usefulness of this approach is uncertain but 
speculate that the phase space representation may inspire 
improvement and innovation in quantum algorithms. It 
certainly makes for some inspiring pictures! 

The Leonhardt phase space formalism has also been 



applied to study decoherence in quantum walks (Lopez 



and Paz 2003). For large system, numerics are often 



employed to study the main features. The phase space 
method offers an intuitive and visual alternative. It al- 
lows one to visually see the quantum interference and its 
disappearance under decoherence. Related to these is a 
hybrid approach between the Wootters and Leonhardt 
discrete phase spaces used to analyze various aspects of 



quantum teleportation ( Paz 2002 ) 



E. Finite fields discrete phase space representation 

Recall that when dim('H) = d is prime, Wootters de- 
fines the discrete phase space as a d x d lattice indexed by 
the group Z^. In reference (Wootters 2004), Wootters 



generalizes his original construction of a discrete phase 
space to allow the d x d lattice to be indexed by a finite 
field F<j which exists if and only if d — p n is an integer 
power of a prime number. This approach is discussed at 
length in the paper ( Gibbons et al.\ 2004) authored by 
Gibbons, Hoffman and Wootters (GHW). 

Similar to his earlier approach, Wootters defines the 
phase space, as a d x d array of points a — (q,p) € 
Fd x Fd. A line, A, is the set of d points satisfying the 
linear equation aq+bp = c, where all arithmetic is done in 
Fd. Two lines are parallel if their linear equations differ 
in the value of c. 
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The mathematical structure of F^ is appealing because 
lines defined as above have the following useful proper- 
ties: (i) given any two points, exactly one line contains 
both points, (ii) given a point a and a line A not contain- 
ing a, there is exactly one line parallel to A that contains 
a, and (iii) two nonparallel lines intersect at exactly one 
point. Note that these are usual properties of lines in Eu- 
clidean space. As before, the d 2 points of the phase space 
<f>d can be partitioned into d + 1 sets of d parallel lines 
called striations. The line containing the point (q, p) and 
the origin (0, 0) is called a ray and consists of the points 
(sq, sp), where s is a parameter taking values in F^. We 
choose each ray, specified by the equation aq + bp — 0, to 
be the representative of the striation it belongs to. 

A translation in phase space, T a „ , adds a constant vec- 
tor, ceo — (<70jPo), to every phase space point: T ao a> — 
a + ao. Each line, A, in a striation is invariant under a 
translation by any point contained in its ray, parameter- 
ized by the points (sq,sp). That is, 



T (sq,sp) 



A = A. 



(40) 



field tracq '\ of any field element is given by 



tr(x) = ^ 



(42) 



i=0 



There exists a unique field basis, E = {eo, ...e n -i}, such 
that tr(e»iej) = 



We call E the dual of E. 



The construction presented in reference (Gibbons 
et al. 2004[ ) is physically significant for a system of n 
objects (called particles) having a p dimensional Hilbert 
space. A translation operator, T a associated with a point 
in phase space a = (q,p) must act independently on each 
particle in order to preserve the tensor product structure 
of the composite system's Hilbert space. We expand each 
component of the point a into its field basis decomposi- 



tion as in Equation ( 41 1 



n-1 
i=0 



(43) 



and 



The discrete Wigner function is 

where now the Hermitian phase point operators satisfy 
the following properties for a projector valued function 
Q, called a quantum net, to be defined later. 

GHW(4) For each point a, A is Hermitian. 

GHW(5) For any two points a and j3, Tv(A a Ap) = d8 a p. 

GHW(6) For any line A, ^ A » = dQ{\). 

The projector valued function Q assigns quantum states 
to lines in phase space. This mapping is required to 
satisfy the special property of translational covariance, 
which is defined after a short, but necessary, mathemat- 
ical digression. Notice first that properties GHW(4) and 
GHW(5) are identical to Woo(4) and Woo(5). Also note 
that if GHW(6) is to be analogous to Woo(6), the prop- 
erty of translation covariance must be such that the set 
{Q(A)} when A ranges over a striation forms a PVM. 

The set of elements E — {eo, e n -i} C F^ is called a 
field basis for F^ if any element, x, in F<j can be written 



P = ^Pifh 



(44) 



i=0 



with / any element of F<j. Note that the basis we choose 
for p is a multiple of the dual of that chosen for q. Now, 
the translation operator associated with the point (q,p) 
is 



n-1 



T {q,p) = 



(45) 



4=0 



Since X and Z are unitary, T a is unitary. 

We assign with each line in phase space a pure quan- 
tum state. The quantum net Q is defined such that for 
each line, A, Q(X) is the operator which projects onto 
the pure state associated with A. As a consequence of 



the choice of basis for p in Equation (44 1, the state as- 



signed to the line r Q A is obtained through 
Q{r a \)=T a Q{\)Tl 



(46) 



This is the condition of translational covariance and it 
implies that each striation is associated with an orthonor- 
mal basis of the Hilbert space. To see this, recall the 



property in Equation (40 1. From Equation (46), this im- 
plies that, for each s G F^, T( S q,sp) must commute with 
Q(A), where the line A is any line in the striation de- 
fined by the ray consisting of the points (sq, sp). That is, 
the states associated to the lines of the striation must be 



x = y^Xjej, (41) 

i=0 10 Note that we will distinguish the field trace, tr(-), from the usual 

trace of a Hilbert space operator, Tr(-), by the case of the first 

where each xi is an element of the prime field Z„. The letter. 
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common eigenstates of the unitary translation operator F. Discrete Cahill-Glauber formalism 

T( S q,sp), f° r each s £ F^. Thus, the states are orthogonal 
and form a basis for the Hilbert space. That is, their 
projectors form a PVM which makes GHW(6) identical 
to Woo(6) when d is prime. 



In reference (Ruzzi et al. 2005), Ruzzi et al have dis- 



In reference (Gibbons et al. 2004), the authors note 



that, although the association between states and ver- 
tical and horizontal lines is fixed, the quantum net is 
not unique. In fact, there are d d+1 quantum nets which 
satisfy Equation (40). When d is prime, one of these 



quantum nets corresponds exactly to the original discrete 
Wigner function defined by Wootters in Section [TV.B| 



1. Application: quantum computation 



As conjectured by Galvao (Galvao 20051, the authors 
of reference ( |Cormick et al. 2006 ) have shown the only 
quantum states having a non-negative discrete Wigner 
function £j are convex combinations of stabilizer states, 
which are simultaneous eigenstates of the generalized 
Pauli operators (Gottesman 1997). Working only with 



stabilizer states is "classical" in the sense that that one 
can represent them with only a polynomial number of 
classical bits whereas an arbitrary quantum state requires 



a exponential number of bits ( Nielsen and Chuang 2000 ) . 



Strengthening the connection between negativity and 
non-classicality, it was also shown that the unitary opera- 
tors preserving the non-negativity of the discrete Wigner 
function are a subset of the Clifford group, which are 
those unitaries which preserve Pauli operators under a 
conjugate mapping. According to the Gottesman-Knill 
theorem, a quantum computation using only operators 
from the Clifford group and stabilizer states can be 



efficiently simulated on a classical computer (Gottes- 



man 



1997). Thus, as noted in reference (Galvao 2005) 



for a particular computational model, negativity of the 
Wigner function is necessary for quantum computational 
speedup. A direct generalization of this result to continu- 



ous variables appears in reference ( Bartlett and Sanders 



2002) where the authors generalize the stabilizer states 



to the familiar Gaussian states used in quantum optics 
settings. 

This discrete Wigner function was also used to analyze 



quantum error correcting codes in reference (Paz et al. 
|2005[ ). The aim was to gain insights and intuition for 
various quantum maps by studying their pictorial repre- 
sentation in the discrete phase space. 



crctizcd the Cahill-Glauber phase-space formalism. The 
set of operators {S(rj, £)}, where 77, £ e [—1, 1] and I = 
(d odd), is called the Schwinger basis and explicitly given 
by 

v« 

These d 2 operators form an orthonormal basis for the 
space of linear operators. In analogy with the Cahill- 
Glauber formalism, the basis is generalized to 

where \s\ < 1 is any complex number and JC(rj,£) is a 
(relatively) complicated expression of Jacobi ^-functions 
(see the Appendix of reference ( Ruzzi et al. 2005[ )). Next 
we take the Fourier transform 



-(r)q+£p) 



>7,f=-i 

The set operators {T^ s \q,p)} is the discrete analog of 
the s-ordered mapping kernel of the Cahill-Glauber for- 



malism. Moreover, the authors of reference (Ruzzi et al. 



2005 ) have shown that the continuous limit of this set is 



indeed the Cahill-Glauber mapping kernel. 

Suppose s is real. Then , the operators {T^(q,p)} 
enjoy the following familiar properties: 

T«(g,p)t =TW( 9 ,p), 
Tr{T ( - s Xq,p)T^\q',pi))=d5 qq ,5 pp ,. 

Thus, similarly to the discrete kernel of Heiss and 
Weigert, {T^(q,p)} and {T<~- S \q,p)} are dual bases for 
the space of Hermitian operators. 

In the now familiar way, we can define a quasi- 
probability function on the (q,p) phase space as 



^(q,p)=Tr(T^(q,p)p). 



(47) 



Cahill and Glauber showed, for their s-ordered formal- 
ism, that s = corresponds to the Wigner function; 
s = 1 corresponds to the Husimi function; and s = — 1 
corresponds to the Glauber-Sundarshan function. Using 



equation (47), we call, for example, the function obtained 
when s — the discrete Wigner function. 



1. Application: quantum teleportation 



11 Note that it is assumed the discrete Wigner function is non- 
negative for all definitions - that is, for all quantum nets. 



Marchiolli et al have applied this formalism to quan- 



tum tomography and teleportation (Marchiolli et al. 



2005). The teleportation protocol was analyzed for ar- 
bitrary s but, for brevity, we will consider the s = 
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case (which is now assumed so the superscript can be ig- 
nored). The teleportation protocol utilizes entanglement 
to transfer a quantum state between two parties through 
the exchange of only a small amount of classical informa- 



tion (Nielsen and Chuang 2000). Consider the tripartite 
system 



P = P W 



,(2,3) 



where one party possess subsystem 1 and 2 and the other 
possess subsystem 3. The goal is for p^> to be transferred 
from subsystem 1 to subsystem 3 without simply swap- 
ping them. It is essential that the shared state p( 2 ' 3 ) 
be entangled. In particular, assume it is a maximally 



entangled pure Bell-state (Nielsen and Chuang 20001. 



We choose, following Wootters (Woottcrs 19871, to con- 



struct the global phase space to be a Cartesian product 
of the phase spaces of the individual subsystems. The 
discrete Wigner function of the whole system is then 

Pp (qi , <?2, 53! Pl,P2, Pi) = PpW Pl)PpV,V (<?2 , 93 5 P2 , P3 ) ■ 

A Bell-measurement is performed on the first two sub- 
systems, which in phase space is interpreted as a mea- 
surement of the total momentum and relative coordi- 
nate of the subsystem composed of subsystems 1 and 2. 
Marginalizing over subsystems 1 and 2 gives 

M P (3)(?3,P3) =/yi>(?3-a,P3 + /8), 

where a and j3 parameterize the result of the Bell- 
measurement (note p( 3 -* can be identified as the reduced 
state of subsystem 3). Thus, the final state of the sub- 
system 3 is simply a displacement in the phase space and 
communicating only the measurement result (a, f$) leads 
to recovery of the initial state. 

The discrete Husimi function (s = 1) was used to de- 



fine a discrete analog of squeezed states ( Marchiolli et al. 



2007 ) and to analyze spin tunneling effects in a particu- 



lar toy model of interacting fermions (Marchiolli et al. 



2009) 



G. Probability tables 

In 1986, before introducing the discrete Wigner func- 
tion, Wootters represented the quantum state as a "prob- 
ability table" which was simply a list of outcome prob- 



abilities for a complete set of measurements (Wootters 



1986). The complete set of interest was that of mutually 
unbiased bases (MUBs). We call n bases {ip%} mutually 
unbiased if they satisfy 



= 5kk>$nn' + ^(1 - <W)- (48) 



(Ivonovic 1981 1. Wootters also posed many questions of 



MUBs, some of which have now been answered. It is now 
known that for any dimension 3 < n < d + 1, where the 
upper bound can be achieved, by construction, for any 
dimension which is a power of a prime 

Here we will consider the case when d is prime and all 
probability tables for non prime dimensions can be built 
up from those for their prime factors, in much the same 
was as was done in section |IV.B| for the discrete phase 
spaces. 

Consider the generalized Pauli operator Z and its 
eigenbasis {0fe} and the projectors onto these vectors 
Pk '■= fyk^X- Define the finite Fourier transform 



F = 



1 

V2 



d-l 

£ 

fc,fc'=0 



(49) 



and the operator 



V = 



d-l 

£' 

k=0 



FP k F\ 



(50) 



Here, as before, division by two represents the multiplica- 
tive inverse of the element 2. For Hilbert space dimension 
d = 2, this operator requires the special definition 



V 



1 + i 1 - i 
1 + i 1-i 



(51) 



Now we can construct d + 1 MUBs via 



4>l 



V n fa, n = 1, 



We will denote the projectors onto these basis vectors 
P(n, k) := ipk^k* ■ Then the probability of obtaining the 
fcth outcome when measuring in the nth basis is 



fj,p(n, fc) = Tr(pP(n, fc)). 



(52) 



This can be view as a matrix in which the columns index 
the measurements while the rows index the outcomes. 
This can also be viewed as a mapping whose inverse is 
given by 



d d-l 



J2J2p P (n,k)P(n,k)-l. 



(53) 



71=0 fc=0 



1. Application: quantum mechanics without amplitudes 



The purpose of reference (Wootters 1986) was not to 



introduce a new representation of the quantum state per 



Wootters noted for d prime, a set of n = d+1 MUBs could 
be explicitly constructed via a prescription in reference 



12 For a recent review of the MUB problem see ( Combescure 



2006) 
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se, but to show that the whole of operational formalism 
of quantum mechanics can be done rather simply without 
complex numbers. 
Wootters notes first: 

It is obviously possible to devise a formula- 
tion of quantum mechanics without probabil- 
ity amplitudes. One is never forced to use 
any quantities in one's theory other than the 
raw results of measurements. However, there 
is no reason to expect such a formulation to 
be anything other than extremely ugly. 

To our surprise, the rule for transitioning between the 
probability tables turn out to be remarkably simple. In 
quantum theory the transition probability from state p 
to p' is the probability of preparing the state p, perform- 
ing the measurement {p' , 1 — p'} and obtaining p' . This 
transition probability is Pr(p — > p') = Tr^p'). If we 
work with the probability tables and call the tables p 
and p! , Wootters obtains 



d d-1 



Pr(/x ->• p!) = p(n, k)p'(n, k) - 1. 



(54) 



n=Q fc=0 



Unfortunately, as Wootters notes, it is not easy to ig- 
nore the density matrix altogether. We have yet to spec- 
ify which probability tables are valid and which do not 
correspond to quantum states. The simplest characteri- 
zation of valid probability tables is to say those for which 



equation ( 53 ) is a unit trace positive semi-definite matrix. 



This is unsatisfying as we would like a characterization 
independent of the density matrix. 



H. Hardy's vector representation 



Now the outcome of any quantum measurement can be 
assigned a positive operator E. Call this "outcome E" . 
Define the vector £e(oi) implicitly through 

E = Y,ZE(a)P(a). 

ot 

Then, the probability of "outcome E" is given by 
Pr( "outcome E") = gg(a)/jp(a), 



which, in vector notation, we can write as the dot product 

We define the sets M and S as the set of vectors ob- 
tainable through the mappings p i— > p and £h>^ defined 
above. More precise statements, in the form of inequali- 
ties, which make no recourse to the usual quantum me- 
chanical objects, can be made to define these sets. As- 
suming this has been done, we can rephrase the axioms of 
quantum mechanics, without mention of Hermitian oper- 
ators and the like, in this vector representation succinctly 
as follows: states are represent by vectors ~~jjt £ M; mea- 
surement outcomes are represented by vectors £ € 5; 
the probability of "outcome £ " in state ~pt is given by 
Pr( "outcome £")= £ • 7* • 



1. Application: quantum axiomatics 

As was the case in the previous section, this vector 
representation was not introduced as such. In references 
(Hardy 2001a|b ), Hardy has shown that five axioms are 
sufficient to imply the real vector formalism of quantum 
mechanics. The frequency interpretation of probability 
was given its own axiom. However, if we take our every- 



day intuitive notion of probability (Porta ManaL 2007), 



we no longer require this first axiom, which is indepen- 



In reference (Hardy, 2001a I Hardy showed that five dent of the rest (Schack 2003) 



axioms are sufficient to imply a special vector represen- 
tation which is equivalent to an operational form of quan- 
tum theory. We first describe the vector representation. 

Consider a basis for a d dimensional Hilbert space {<fik} 
(the eigenbasis of Z, say) and the following set of d 2 
projectors: 



b k (j>l if k = j 

Pkj ■■= { (fo + </>j)(<t>k + fa)* if k < j . 

*>k + i(f>j){4>k + i<t>j)* if k > 3 



(55) 



These projectors span the space of linear operators on 
the Hilbert space spanned by {4> k }- Now we vectorize by 
choosing an arbitrary but fixed ordering convention. For 
definiteness, we choose to stack the rows on top of one 
another. To this end, define a := dk+j and P(a) := Pkj- 
Then, the vector representation of the state p is given by 



We will make use of the following definitions: 

• The number of degrees of freedom, K, is defined 
as the minimum number of yes-no measurements 
whose outcome probabilities are needed to deter- 
mine the state (of belief in the mind of a reason- 
able agent), or, more roughly, as the number of real 
parameters required to specify the state. 

• The dimension, d, is defined as the maximum num- 
ber of states that can be reliably distinguished from 
one another in a single shot measurement. 

Axiom 1 defines probability as limiting frequencies and 
is not required (Schack 2003). The remainder of the 



p p (a) = Tv(pP(a)). 



(56) 



axioms are as follows: 

2 Simplicity. K is determined by a function of N 
(i.e. K — K(d)) where d = 1,2,... and where, for 
each given d, K takes the minimum value consistent 
with the axioms. 



17 



3 Subspaces. A system whose state is constrained to 
belong to an n dimensional subspace (i.e. have sup- 
port on only n of a set of d possible distinguishable 
states) behaves like a system of dimension n. 

4 Composite systems. A composite system consisting 
of subsystems A and B satisfies d — d A ds and 
K = K A K B . 

5 Continuity. There exists a continuous reversible 
transformation on a system between any two pure 
states of that system. 

These four axioms are sufficient for a derivation of the 
vector representation of quantum theory defined above. 
This axiomatization is also important for contrasting 
quantum theory with classical probability theory. As 
Hardy has shown, discrete classical probability theory (of 
dice, coins and so on) can be derived from only axioms 
2, 3 and 4. That is, the only difference between quan- 
tum and classical theory is the existence of a continuous 
reversible transformation between pure states. 



I. The real density matrix 



In reference ( Havel 2002 ) Havel defined the "real den- 



sity matrix" which, not surprisingly, is a particular real- 
valued matrix representation of the quantum state. 
For d — 2, define the 2x2 matrix of Pauli operators 

as 



P = 



1 X 
Y Z I " 



(57) 



For d — 2™, denote the bits in the binary expansion of 
k as 



fc = ]Tfc Q x2"-<\ 



and similarly for j. Then, the d x d matrix of Pauli 
operators is given by 



Pkj — -Pfciji 



(58) 



These d 2 operators are orthogonal, and hence form a ba- 
sis for the space of linear operators on the d dimensional 
Hilbcrt space. Therefore, each density matrix can be ex- 
pressed as 



^ d-l 



fc,i=0 



where the coefficients o~kj , explicitly given by 

a kj = Tr(pP kj ), (59) 
form the real density matrix. 



1. Application: NMR pedagogy 



Since the observables measured in NMR experiments 



are elements of the matrix of Pauli operators (58), the el 



cmcnts of the real density matrix are the experimentally 
measurable values. There is no need to reconstruct the 
density matrix. This is also a convenient fix to the prob- 
lem of reporting or visualizing a quantum state. Since 
the density matrix contains d 2 complex values, it is often 
graphically displayed as two d x d matrices of the real 
and imaginary parts. Not only is this redundant, it is 
conceptually awkward. On the other hand, the real den- 
sity matrix can be displayed as a single d x d matrix of 
real values. Havel offers the real density matrix as useful 
teaching device in such situations. 



J. Symmetric representations 



Consider the unitary group 

U M =u>"X'Z*, 



(60) 



where (p,q) eZ^x 7L d . In reference ( |Renes et al.\ 2004| ), 
the authors conjectural that the set {U( Piq )(j)} for some 
fiducial (f> £ % forms a symmetric informationally com- 
plete positive operator valued measure (SIC-POVM). The 
defining condition of a SIC-POVM is a set of d 2 vectors 
{(f>k} such that 



Skjd + 1 
d+1 



(61) 



The set is called symmetric since the vectors have equal 
overlap. The POVM is formed by taking the projectors 
onto the one-dimensional subspaces spanned by the vec- 
tors. It is informationally complete since these d 2 pro- 
jectors span the space of linear operators acting on %. 

As of writing, it is an open question whether SIC- 
POVMs exist in every dimension. Although numerical 



evidence suggests this to be the case (Scott and Grassl 



2010) 



For the remainder of this section we assume, for any 
dimension d, a SIC-POVM exists. Define the operators 
Pk '■= h^k^h' Then, define the symmetric-representation 



of a quantum state p as 



fi p (k) = Tv(pP k ). 



(62) 



This is a probability distribution and in particular it is 
the probability distribution for the POVM measurement 
formed by the effects {Pk}- As we have noted, this is an 



1S Apparently this was conjectured earlier by Zauner in a Ph.D. 
thesis not available in english. See reference ( Zauner 1999 ) and 
http: //www. imaph. tu-bs . de/qi/problems/23 .html 
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informationally complete measurement. Therefore, the 
density matrix can be reconstructed from the probabili- 
ties via 



= d(d+i) J2 PpWu-i. 



(63) 



fe=0 



When viewed as a mapping, this representation is a 
bijection from the convex set of density matrices to a 
convex subset of the <i 2 -dimensional probability simplex. 



1. Application: Quantum Bayesianism 



Quantum Bayesianism ( 


Caves et al. 




2002a 


Fuchs 


2002 


2010 Fuchs and Schack 


2009 Schack et al. 


2001 



is an interpretation of quantum theory which sheds new 
light on not only the tradition "foundational" prob- 
lems (the "measurement problem" , for example) but also 
many concepts in quantum theory, such as the "unknown 
quantum state" (Caves et al. 2002b). A key realization 



is the mathematical and conceptual sufficiency of view- 
ing quantum states as the probability distribution via the 
Born rule for a fixed POVM {E k }. The only remaining 
freedom is which one. 

One ideal is to have the POVM elements orthogonal: 



The statement that is not possible is 



equivalent to theorem [T] Next, then, we desire them to 
be as close to orthogonal as possible. Formally, we want 
to minimize the quantity 

F = ^(Tr(P k P j )-S kj ) 2 . 

kj 

This expression is minimized if and only if the {Pk} form 
a SIC-POVM. Using the reconstruction formula in equa- 
tion (63 1 it can be shown that, in terms of this SIC- 



representation, the Born rule for a measurement {Ej} 
given state fi(k) is 



Pr(outcome j) = I /x(fc) 



am, 



(64) 



where £(j\k) = Ti(EjP k ). In the same sense as the SIC- 
POVM being as close to orthogonal as possible, equation 



(64) is as close as possible to the classical Law of Total 



Probability. Effort is being made to use equation ( 64 ) as 
a starting point for a natural set of axioms which would 
single out quantum theory. 



Considered here is a generalization of a basis for M(T~L), 
which is the set of Hermitian operators on an complex 
Hilbert space of dimension d. With the trace inner prod- 
uct (A, B) := Tr(AB), U(H) forms a real Hilbert space 
itself of dimension d 2 . Let A be some set of cardinality 
d 2 < |A| < co. 

A fram^l for M(H) is a set of operators J- := 
{F(X)} C W(H) which satisfies 



(65) 



AeA 



for all A E M(H) and some constants a, b > 0. This defi- 
nition generalizes a defining condition for an orthogonal 
basis {B k }f =1 



J2nBkA] 2 = \\A\\ 



(66) 



k=l 



for all A e W(H). The mapping A H> Tr[F(X)A] is called 
a frame representation of M.(H). 

A frame T> := {D(X}} which satisfies 



A=^Tr[F(AL4]Z>(A), 
AeA 



(67) 



for all A € H(7{), is a dual frame (to F). The frame 
operator associated with the frame T is defined as 



S(A) : =^Tr[F(AL4]F(A). 



AeA 



If the frame operator is proportional to the identity su- 
peroperator, S = al, the frame is called tight. The frame 
operator is invertible and thus every operator has a rep- 
resentation 



A = S^SA = J2 Tr[F(X)A]S- 1 F(\). 
AeA 



(68) 



The frame S~ 1 J- is called the canonical dual frame. 
When |A| = d 2 , the canonical dual frame is the unique 
dual, otherwise there are infinitely many choices for a 
dual frame. A tight frame is ideal from the perspective 
that its canonical dual is proportional to the frame itself. 
Hence, the reconstruction is given by the convenient for- 
mula 

A = S^SA = - V Tt[F(\)A]F(X) 
a ^-^ 

AeA 

which is to be compared with 



V. UNIFICATION OF THE QUASI-PROBABILITY 
REPRESENTATIONS VIA FRAMES 

A. Introduction to frames 



A = J2Tr[B k A]B k 



k=l 



which defines {B k } k=1 as an orthonormal basis. 



A frame can be thought of as a generalization of an or- 
thonormal basis (Christensen 2003). However, the par- 



ticular Hilbert space under consideration here is not H. 



14 Frames have been considered in the context of quantum theory 
for other purposes in i jBisio et qZT] |2009| |Scott| |2006| l . 
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B. Unification via the necessity and sufficiency of frames 



C. To infinity and beyond 



Recalling the formal definition Q of a quasi- 
probability representation, we have the following theo- 
rem 

Theorem 3. Two functions fi and £ constitute a quasi- 
probability representation if and only if 

Mp (A) = Ti[pF(X)} 
£ B (A)=Tr[FD(A)], 

where {F(X}} is a frame and {D(X}} is one if its duals. 



This was proven in reference (Ferrie and Emerson 



2009 This theorem allows us to make the follow- 
ing statement which is equivalent to the no-classical- 
representation theorem ([TJ and negativity theorem 
there does not exists two frames of positive operators 
which are dual to each other. 

With this results we can create quasi-probability rep- 
resentations of the whole operational formalism of quan- 
tum theory, not just states. First, we chose one of the 
discrete quasi-probability functions described in section 
|IV| Second, we identify the frame which gives rise to it. 
Lastly we compute its dual frame to obtain the part of 
the quasi-probability representation mapping, £, which 
takes measurements to functions on the space A. 

Suppose instead the functions /i and £ are defined via 

M ,(A) = Tt[ P F(X)} 
£e(A) = r Tr[EF(X)], 

where {F(X}} is a frame. Then, 

(a) Mp (A)G [0,1] and£ A MA) = l, 

(b) £ E (\)e [0,1] and Ci(A) = l, 

(c) Tr(pF) = Ea,vMp(A)^(A')TV[ j D(A) j D(A')]. 



In reference (Ferrie and Emerson 2008) this represen- 



tation was called a deformed probability representation 
since states and measurements are represented as true 
probabilities but the law of total probability is deformed. 

The frame formalism also provides a convenient trans- 
formation matrix to map between representations. We 
have 

Mp (A) = Tr[pF(A)] 

= £Tr[pF'(A')]Tr[iy(A')F(A)] 



where the matrix T\>\ is the symmetric matrix which 
takes the /i representation to p! representation. 



Given a quasi-probability representation note that the 
frame satisfies 



5>(A) = 1. 



AgA 

Thus, if the quasi-probability representation satisfies < 
A*(A) < 1, the frame is an informationally complete pos- 
itive operator valued measure (IC-POVM^J Similarly, 
the dual frame satisfies 

Tr[D(\)] = 1, 

for all A e A. Thus, if the the quasi-probability represen- 
tation satisfies < £(A) < 1, the dual frame is a set of 
density operators. The definitions and results we have so 
far considered are tailored to the case d < oo - that is, 
finite dimensional quantum theory. Now we will extend 



them to infinite dimensions as done in reference (Ferrie 



et al. 2010). 



We allow for now the dimension of the Hilbert space 
% to be arbitrary and let (f2, S) be a measurable space, 
where S is a cr-algebra. Over this space, M«.(Q, de- 
notes the bounded signed measures while J-r(S1, S) de- 
notes the bounded measurable functions. A signed mea- 
sure generalizes the usual notion of measure to allow for 
negative values. The classical states are the probability 
measures 5(f2,S) C A^R(fi,S). We define the gener- 
alization of frame to the space of trace-class operators 
T a (H). 

We generalize the definition of informationally com- 
plete observable for infinite dimensions and define an op- 
erator valued measure as a map F : £ — > B S (H) satisfying 
F(0) = 0, F(Q) = 1 and 



\i=l 



i=l 



where the sets Bi € S are mutually disjoint and the 
sum converges in the weak sense. A frame for 71(1-1) is 
an operator valued measure F for which the map T : 
T S {U) -> -M R (fi,E), 

T(W)(B) := Tr(WF(B)), 

is injective. The map T is called a frame representation 
ofT s (H). 

Similarly, generalizing the reconstruction formula 
defining the dual frame for finite dimensions yields the 
generalized notion of a dual. That is, given a frame F, 



16 Frames have also been used in definition of informationally com- 
15 Compare this to a similar result in a more operational setting in pleteness in the context of tomography in reference jBisio et al.\ 



references | |Porta Mana| |2004a|b[ ). |2009| . 
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a dual frame to F is a map D : SI —> B S (H)*, the dual 
space, for which the function 

(SA)(u>) := (D(u))(A) 

is measurable and satisfies 



A= / SAdF, 



(69) 



for all A G B S (W). 

On the other hand, we define a quasi-probability rep- 
resentation of quantum mechanics is a pair of mappings 
T : S(H) -> M K (fi,£) and 5 : -> Jk(n,E) such 

that 

1. T and 5 are affine, T is bounded. 

2. Tp(O) = 1. 

3. 5(0) = 0. 

4. For all p G 5(W) and B G £(%), 



Tr(pE) = / (SE)d(Tp). 
Jn 



(70) 



In reference (Ferrie et al. 2010) it was shown that for 



these generalized definitions frame representations and 
quasi-probability representations remain equivalent and 
the following "negativity theorem" was proven: 

Theorem 4. A quasi-probability representation of quan- 
tum mechanics must have, for some p G S(T-L), E G £(fi) 
either [Tp){B) < for some B G 3 or (SE)(cj) g [0, 1] 
/or some cj E SI. 

Being more general, this theorem implies the previous 
three equivalent "negativity theorems" and says essen- 
tially the same; a classical representation does not exist 
and within a quasi-probability representation negativity 
must appear in the representation of the states or mea- 
surements or both. 



VI. NEGATIVITY IN QUASI-PROBABILITY 
REPRESENTATION 

There have been a variety of approaches to the problem 
of characterizing what is non-classical about quantum 
theory. In the previous section one such notion of non- 
classicality was considered: the requirement of "negative 
probability" , or simply negativity. However, the negativ- 
ity theorem leaves open the question of the interpretation 
of negativity in any particular representation. In section 
| VI. A| below we discuss the ways in which negativity can 
be applied as a criterion for quantumness with respect 
to particular choices of representation. In section |VI.B 
the traditional ideas of contextuality and nonlocality in 
relation to negativity [^j 



A. Negativity as an indicator of quantumness 

There is a strong tradition in physics of considering 
negativity of the Wigner function as an indicator of non- 
classical features of quantum states. The non-classical 
features attributed to negativity of the original Wigner 



function include quantum nonlocality (Bell 2004 Kalev 



et al. 2009), quantum chaos (Paz et al. 



turn coherence (Paz et al. 1993). In quantum optics, 



1993) and quan- 



however, tradition has been to use the Q- and P-functions 



(section III.B) to define quantumness (Mandel 1986). 
Recall equation (17), where the P-function of p was de- 



fined implicitly through 

p = J d 2 aP{a)\a)(a\. 

If P has the properties of a probability distribution, then 
the state is a mixture of coherent states. Coherent states 
are minimum uncertainty states and this fact is often 
cited when it is stated that such a state is "the most 
classical" of the quantum states of light. More specifi- 
cally, if P is a probability distribution then the quantum 
field cannot display genuine quantum optical effects and 
can be simulated by a stochastic classical electromagnetic 
field (iWalls and Milburn 1995). Technically, however, P 



is not a function but a distribution which can be highly 
singular. Thus P functions which are not classical distri- 
butions arc difficult to experimentally prepare and verify; 



although, recent progress has been made (Kiesel et al. 



2008). 



Effort has been extended beyond qualitatively defin- 
ing negativity as quantumness to quantifying quantum- 
ness via negativity. In terms of the Wigner function, the 
volume of the negative parts of the represented quan- 
tum state has been s uggested as the appropriate mea - 
sure of quantumness (Kenfack and Zyczkowski 2004). 



The distance (in some some preferred norm on M(fH)) to 
the convex subset of positive Wigner functions was sug- 



gested to quantify quantumness in reference ( Mari et al. 



2010). This was also done in references (Giraud et al. 



2008 



2010 1 for a finite analogs of the P- and Q-functions 



rather than the Wigner function. 

The main difficulty with interpreting negativity in a 
particular quasi-probability representation as a criterion 
for or definition of quantumness is the non-uniqueness of 
that particular quasi-probability representation. We can 
always find a new representation in which any given state 
admits a non-negative quasi-probability representation. 
Recall, in fact, that in some representations all states 
are non-negative. Thus, negativity of some state p in 
one particular arbitrary representation is a meaningless 
notion of quantumness per se. 



17 These are diverse and rich fields of study in their own right. A 
starting point for the interested reader on contextuality is ref- 



erenc e (|Held| |2008| and quantum nonlocality is reference | |Red-| 
head|[l989p 
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An alternative approach to establishing a connection 
between quantumness and negativity is to start by as- 
suming some criterion for quantumness and then find- 
ing a choice of representation in which this criterion is 
expressed via negativity. This approach has been ap- 
plied in the context of multi-partite systems for which 
entanglement is presumed to provide a criterion for quan- 
tumness. Entanglement is a kind of correlation between 
two quantum systems which cannot be achieved for clas- 
sical variables and is one of the central ingredients in 
quantum information theory F^] Recall that a density 
matrix p is entangled if it cannot be written as a con- 
vex combination of the form p = X^fcPfePfe ® P)f' \ f° r 
all k, where and pjp are states on the individual 
subsystems. Consider a product-state frame constructed 
out of frames for two subsystems. That is, consider the 
frame {F^(A) <g) F^(X')}, where each {F^(X}} is a 
set of density matrices composing a frame. Then, if wc 
represent a quantum state using the dual frame, we a 
have a quasi-probability representation in which states 
with negativity are entangled. Explicit constructions of 
such quasi-probability representations were developed by 
Schack and Caves (Schack and Caves 2000). An opti- 



mization procedure to find the representation with the 
minimum amount of negativity was given in reference 



(Sperling and Vogel 2009) 



An obvious limitation with the above approach is that 
entanglement cannot capture any notion of quantumness 
for single quantum systems. A second, more subtle, is- 
sue is that identification of entanglement as "the" crucial 
non-classical resources is problematic in certain branches 
of quantum information science. The most striking ex- 
ample questioning the role of entanglement in quan- 
tum information theory is DQC1 (deterministic quan- 
tum computing with one clean qubit). DQC1 (|Knill and 



Laflamme 1998 1 is a model of computation which refers 
to any algorithm which satisfies the following (or a mod- 
ification not requiring exponentially more resources j^j 

1. its input consists of a single pure state in the first 
control register and the remaining n registers are 
in the maximally mixed state p = 2 _n l; 

2. the input state is subjected to a unitary U n con- 
trolled by the state of the first register; 

3. the output is a statistical estimate of 2 -n Tr(Z7 n ) 
(achieved by measuring the average of control bit 
in the Z basis). 



2009) 



For a recent review of entanglement, see ( Horodecki et al. 
This model of computation has served as the basis tor various 
definitions of complexity classes, also called DQC1. There are 
many open questions in this line of research and the interested 
reader should consult refere nces (|Ambainis et al.\ |2006| |Shep-| 
herd|[2006l|Shor and Jordan] |2008[ ).~ 



DQC1 appears to be a non-trivial computational model 
which has been shown to have exponential advantages 
over (known) classical algorithms in the the follow areas: 



simulation of quantum systems (Knill and Laflamme 



1998), quadratically signed weight enumerators (Knill 



and Laflamme 2001), evaluating the local density of 
states (Emerson et al. 20041, estimating the average fi- 



delity decay under quantum maps (Poulin et al. 2004) 



and estimating the value of Jones polynomials ( Shor and 



Jordan] [2008 1. 

In the DQC1 model, the bipartite split between the 
control qubit and the rest contains no entanglement and 



in reference ( Datta et al. 2005 ) it was shown that there 



is a vanishingly small amount of entanglement across any 
other bipartite splitting. This suggests it is unlikely that 
entanglement is responsible for the speed-up provided by 



DQC1 (Datta 2008). Conceptually, computation is a lo- 
cal task with complex dynamics and may not require the 
non-local, Bell-inequality- violating correlations of entan- 
glement (Laflamme et al. 2002). A sentiment issued in 



reference ( Laflamme et al. 2002 ) and reiterated recently 



by Vedral (Vedral 2010) is that no one single criteria 
can capture quantumness and perhaps even the resources 
necessary for the quantum advantage must be studied on 
a case-by-case basis. 

An important consideration for all of the above ap- 
proaches is that the notion of a quantum state, con- 
sidered in isolation, is operationally meaningless. Com- 
parison with experiment always requires specifying both 
a state (a preparation procedure) and a measurement. 
Consider two experiments, one which prepares a product 
state and measures the state by projecting onto an en- 
tangled basis, and a second which prepares an entangled 
state and measures that state in a product basis. Both 
experiments produce the same statistical predictions, but 
only the second is considered non-classical when consid- 
ering the state in isolation. As emphasized in references 



Ferrie and Emerson 


2008 


Spekkens 


2008) 



come this obvious deficiency if we consider the whole op- 
erational set-up - states and measurements. In this way, 
the existence of a positive quasi-probability representa- 
tion implies the existence of a non-contextual ontological 
model and vice versa. 



The formalism of references ( Ferrie and Emerson 2008 



2009 Ferrie et al. 



2010 ) shows the necessity of negativ- 
ity when considering a representation of the full quantum 
formalism. That is, the negativity theorem (theorem [2]) 
applies to quasi-probability representations of quantum 
theory as a whole. However, the negativity theorem may 
not apply if we consider a specific experiment, device, 
or protocol which may not faithfully reproduce the full 
power of quantum theory. This work suggests the follow- 
ing promising approach: define a classical representation 
of an experiment as the existence of a frame and dual for 
which the convex hull of the experimentally accessible 
states and measurements have positive representation. 
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Then, we can conclude that negativity, taken to mean 
the absence of any representation satisfying the above 
conditions, corresponds to quantumness. 

The above criterion for classicality was considered in 



B. Traditional contextuality and nonlocality 



The traditional definition of contextuality evolved from 
a theorem which appears in a paper by Kochen and 
reference (iSchack and Cavesl 119991) to question the quan- Specker (Kochen and Specker 1967). The Kochen- 



tum nature of proposed NMR quantum computers. How- 
ever, as noted there, the immediate objection is the fol- 
lowing: the states and measurements can be represented 
by classical probabilities while the transformation be- 
tween them may not be represented by classical stochas- 
tic maps. That is, a truly classical model must represent 
each applied transformation in a experiment as classi- 



cal stochastic mapping. In reference (Schack and Caves 



1999), such stochastic maps were identified for the set of 
NMR experiments reported at the time P^l 

The scope of quantum theory that has been consider 
thus far can be thought of as kinematical; only the de- 
scription of experimental configurations is of concern. 
The traditional approach to quantum theory (quantum 
mechanics) focuses on how and why quantum systems 
change in time. Using the Wigner function formalism 
to describe the dynamical transformations predicted by 
quantum mechanics yields the dynamical law 



djip 
dt 



1 



^ 2 2n (2n + 1)! dq 



2n+l 



dp 



2n+l 



(71) 

where /i p is the Wigner function and H is the classical 
Hamiltonian and {H, /i p } is the classical Poisson bracket. 
Notice then that Equation ( 71 ) is of the form "classical 
evolution" + "quantum correction terms" . Using this 
formalism, one can then do more than discuss which ex- 
perimental procedures are classical. Now one can discuss 
the transitions between quantum and classical descrip- 



tions, a process known as decoherence (Habib et al. 1998 



Joos 


2003 


Paz et al. 


1993) 



The representation of the 
dynamics was also studied for the spherical phase space 



in (Kalmykov et al. 2008 Zueco and Calvo 20071. The 



goal is to compare the representation of the quantum dy- 
namics (be it the Schrodinger equation or the more gen- 
eral master equation) to the natural classical dynamics of 
the representation's phase space. The challenge for finite 
dimensional systems is that no natural notion of discrete 
phase space exists for classical system. This problem 



has been recently studied by Livine (Livine 20101 by in- 
troducing a discrete differential calculus for the discrete 
phase spaces of Wootters. However, beyond these few 
examples, transformations and dynamics have not been 
studied anywhere near to the extent that states have for 
quasi-probability representations and presents itself as a 
open problem. 



20 Note, however, that the reasonable requirement of an efficient 
classical model was not met. 



Specker theorem concerns the standard quantum formal- 
ism: physical systems are assigned states in a complex 
Hilbert space % and measurements are made of observ- 
ables represented by Hermitian operators. The theorem 
establishes a contradiction between a set of plausible as- 
sumptions which together imply that quantum systems 
possess a consistent set of pre-mcasurcmcnt values for 
observable quantities. Let H be the Hilbert space associ- 
ated with a quantum system and A e H('H) be the opera- 
tor associated with some observable. The function ftp(A) 
represents the value of the observable when the system is 
in state ip. One assumption used to derive the contradic- 
tion is that for any function F, f^(F(A)) = F(f^(A)). 
This is plausible because, for example, we would expect 
that the value of A 2 could be obtained in this way from 
the value of A. 

The conclusion of Kochen-Specker theorem implies the 
following counterintuitive example (Isham 1995[ ). Sup- 
pose three operators A, B, and C satisfy [A,B] = = 
[A, C], but [B, C] ^ 0. Then, the value of the observable 
A will depend on whether observable B or C is chosen 
to be measured as well. That is, assuming that physi- 
cal systems do possess values which can be revealed via 
measurements, the value of A depends on the context of 
the measurement. 

What the Kochen-Specker theorem establishes then is 
the mathematical framework of quantum theory does not 
allow for a non- contextual model for pre-measurement 
values. This fact is often expressed via the phrase "quan- 
tum theory is contextual" . 

The original notion of contextuality in lacking in the 
sense that it only applies to the standard formalism of 
quantum theory and does not apply to general opera- 
tional models. This problem was addressed by Spekkens 
as discussed in section |TT] above. The notion of contex- 
tuality defined by Spekkens is more general; one can re- 
cover the original assumptions of Kochen-Specker by as- 
suming that the projector valued measures in the spec- 
tral resolutions of obscrvables are represented by disper- 
sion free (0-1 valued) conditional probabilities (these are 
also called sharp indicator functions)^^ Since the set of 
fewer assumptions already contains a contradiction when 
taken in conjunction, the addition of the assumption of 



21 Cabello has also generalized the notion of contextuality to 
POVMs | |Cabello| [2003}. A gain, however, the additional as- 
sumption of dispersion free condition probabilities is used. See 
( |Grudka an d Kurzyriski 20081 for an elaboration on this point. 
For a more broad discussion on contextuality see | |Morris| 2009 ) 
and |Harrigan and Rudolph||2007| |. 
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Kochen-Specker is unnecessary. Thus, we need only con- 
sider the more general notion of contextuality we have 
already defined. This more general notion of contextual- 
ity has also recently been subject to experimental tests 



property of any theory that quantum mechanics was con- 
cluded to be incomplete. That is, there must be elements 
of physical reality (hidden variables) which quantum me- 
chanics does not account for. 

The argument of EPR was reformulated by Bohm 
A hidden variable theory originally formulated by de (Bohm 1989) for two qubits. The argument is built 



(Spekkens et at 2009). 



Broglic and later by Bohm (Bohm 1989) is perhaps the 



most famous example of an ontological model of quantum 
theory. The model assumes that for a given experimen- 
tal configuration, there exists a particle with well defined 
trajectory and a quantum state ip. The hidden variable 
is the position of the particle in real space. That is, the 
classical state space is A = M 3 x %. The Hilbert space is 
included in the state space as its serves as a wave which 
guides the particle. If at any time the particle is dis- 
tributed according to quantum probability distribution 
|0| 2 , it remains so. Thus, so long as it is assumed that 
the particle is prepared according to this distribution, 
the model provides the same predictions as the standard 
formulation of quantum theory. 

Note that this model does not fit into the framework 
of quasi-probability representations. Exactly as it was 
for the Beltrametti-Bugajski model, the de Broglie-Bohm 
model does not consider the entire range of possible quan- 
tum states. Where a classical representation contains a 
convex-linear mapping p i-> p(X) for all p 6 0(H), the 
dc Broglie-Bohm model considers only a mapping with 



domain H. Bell notes that ( Bell 2004) "in the de Broglie- 



Bohm theory a fundamental significance is given to the 
wavefunction, and it cannot be transferred to the density 
matrix." 

Bell does not claim that the situation is such that the 
de Broglie-Bohm model cannot be extended to include 
density operators. The key words in his comment are 
"fundament significance". Indeed, the de Broglie-Bohm 
model can be extended to include density operators pro- 
vided this extension is cither contextual or contains neg- 
ativity. In either case, the pure states (wavefunctions) re- 
tain their significance while the density operators possess 
non-classical features. As an example, the de Broglie- 
Bohm model could be such that (p, c-p) H> p Cv (A) where 
each preparation consists of a density operator p supplied 
with a context c-p which specifies a particular convex de- 
composition of p into pure states. Such a model would 
be preparation contextual. 

The non-locality debate was initiated by a paper by 



Einstein, Podolsky and Rosen (EPR) (Einstein et al. 



1935 ) where it was argued that quantum mechanics is in- 
complete (each element of physical reality does not have 
a counterpart in quantum theory) if special relativity re- 
mains valid. The latter means physical causation must 
be local or events cannot have causes outside of their 
past light cones. Using a particular spatially separated 
quantum system, and some standard quantum theory, 
EPR concluded that quantum mechanics is either incom- 
plete or nonlocal (or both!). Locality was such a desired 



around the following hypothetical experiment. Two par- 
ties, Alice and Bob, are at distant locations with a source 
midway between them creating quantum systems de- 
scribed by the quantum state 



1> 



1 



V2 



(01 ® 02 - 02 ® 0l), 



(72) 



where {<f>i, 02} is an orthonormal basis for a qubit. One 
particle is sent to Alice and the other to Bob. Alice per- 
forms the projective two-outcome measurement {P-i^P?} 
on the particle which was sent to her. The state in equa- 



tion ( 72 ) is such that Alice, once she performs her mea- 



surement, she can predict with certainty the outcome 
Bob receives when he performs the same measurement 
at his side of the experiment regardless of whether or 
not the measurement events are spacelike separated (i.e. 
nonlocal). For example, Alice could perform the mea- 
surement {0!0J,0202}- According to the collapse pos- 
tulate, if Alice registers the first outcome, Bob particle 
will immediately collapse to 02 and he is certain to ob- 
tain the second outcome if he were to make the same 
measurement. Therefore, unless there exists hidden vari- 
ables which pre-determine the possible outcomes when 
the particles are created, quantum theory is nonlocal. 
Bell later investigated the possibility of finding the hid- 



den variables Einstein thought to exist (Bell 2004). He 



noted immediately that the de Broglie-Bohm theory was 
such a theory yet in contained an astonishingly nonlocal 
character. He was able to prove that any hidden variable 
theory of quantum phenomena must possess nonlocal fea- 
tures. This is now called Bell's theorem. 

The proof is by contraction and follows the general line 
of reasoning which lead to the negativity theorem: build a 
mathematical model with assumptions that can be iden- 
tified with (or motivated by) some notion of classicality 
then prove that quantum theory does not satisfy these as- 
sumptions. Consider the EPR experimental setup. Alice 
and Bob can each perform a two-outcome measurement 
with outcomes labeled A and B, respectively. Without 
loss of generality, the outcomes can be assigned numerical 
values A, B = ±1. 

Suppose there exist a classical state space A (i.e. a 
set of hidden variables or, as we have called it above, an 
ontology) which serves to determine the outcomes A and 
B. Probabilistic knowledge of the state is represented by 
a density p,(\) > which is normalized 



d\p(X) = 1. 



The different measurements Alice and Bob can perform 
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are parameterized by detector settings a and b, respec- 
tively. Locality is enforced by assuming that the out- 
comes A and B depend only the local detector settings 
and the global state. That is A — A(a, A) is allowed but 
A = A(a, b, A) is not. Define the correlation function 



C(a,b) 



dXA(a,X)B(b, A)//(A). 



(73) 



Bell's theorem states that the correlations obtained in 
the EPR experiment (i.e. a particular quantum experi- 
ment) cannot satisfy this equation. The proof follows by 
deriving an inequality from equation ( 73 ) such as 



\C(a,b)-C(a,c)\ < 1 + C(b,c). 



(74) 



This inequality holds for any hidden variable model 
which satisfies the locality assumption. For the quantum 
state in equation (72 1, the inequality is violated. This 



is the contradiction between the quantum theory and a 
local hidden variable model which proves Bell's theorem. 

It was noted that the assumptions which go into the 
hidden variable models first considered by Bell imply 
those models are deterministic. That is, the theorem did 
not exclude models which suggested quantum theory only 
provides stochastic (or probabilistic) information of the 
possible outcomes of measurements. Bell later extended 
the theorem to include such models. For the EPR exper- 
imental setup, let the conditional probability of outcome 
A = 1, for Alice, given the state (hidden variable) is 
A 6 A be denoted Ma (A) and similarly define Ms (A) 
for Bob. Now denote the conditional joint probability of 
the simultaneous outcomes A, B — 1 by Mas (A). Fine 



(Fine 1982) defines a stochastic hidden variable model as 



one which satisfies 



and 



Vv{A = 1) 



Pr(A = 1,B = 1) 



dXM A (X)n(X) 



(75) 



dXM AB (X)v{X). (76) 



If Mab(A) = Ma(A)Mb(A), then the model is factoriz- 
able. Bell claimed this also encoded the assumption of 
locality. Again, it can be shown that quantum theory 
is in contradiction with an inequality derived from these 
assumptions. Fine showed that a factorizable stochastic 
hidden variable model exists for the EPR-type correlation 
experiment if and only if a deterministic hidden variable 
model exists for the experiment. Since the latter is ruled 
out, the former is also ruled out. 

It is often stated that the consequence of Bell's theorem 
is "quantum theory is non-local" . However the theorem 
only states that quantum theory does not satisfy the as- 
sumptions which go into a classical model Bell defines 
as local. It is not necessary that the locality assump- 
tion is violated. Nor is it unanimously agreed that the 



mathematical condition Bell refers to as locality in the 
stochastic hidden variable models reflects any physical 
significance (Fine 1982 1. Next, we will see how such a 
claim can be supported by appealing to the notions of 
negativity. 

Notice that a (non-factorizable) stochastic hidden vari- 
able model is exactly a classical representation. Then fac- 
torizability (Bell locality) is an additional requirement. 
However, the negativity theorem rules out a classical rep- 
resentation independently of any assumption of locality 
for the most general quantum experiment. The nega- 
tivity theorem implies that one (or more) of the con- 
straints which go into the definition of a classical rep- 
resentation are false. Indeed, relaxing the assumptions 
of positive probability yields a quasi-probability repre- 
sentation which is not in conflict with quantum theory. 
The existence of positive probability distributions in the 
hidden variables Bell attempts to rule out is often consid- 
ered an unquestionable assumption. However, if it is the 
case that negative probability encodes something about 
Nature that is independent of locality, then it is not nec- 
essarily the case that Bells theorem implies nonlocality 
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VII. SUMMARY 

In this article, we have seen how quasi-probability rep- 
resentations arise naturally out of the search for an on- 
tological model of quantum theory. A non-contextual 
ontological model implies the existence of a classical 
representation, which was shown to be in contradiction 
with the quantum framework. However, by relaxing 
the requirement of positive probabilities we have seen 
that many gwasi-probability representations are consis- 
tent with quantum theory, and indeed useful for quantum 
information theory. 

We have shown how the theory of frames unifies the 
quasi-probability representations into a single formalism. 
This frame formalism allows two canonical methods for 
incorporating measurements into the framework. In the 
first, we use a dual frame to represent measurements. 
This gives rise to framework respecting the usual dot 
product between the represented functions (which is the 
classical rule of total probability) and is equivalent to the 
formal definition of a quasi-probability representation. In 
the second method, the same frame is used to represent 
states and measurements. This gives rise to the deformed 
probability representation. Here, negativity need not ap- 
pear in the representations of states and measurements 
but the Born rule is replaced by a deformation of the 
classical rule of total probability. 

Finally, we have noted that there is significant interest 
surrounding the presence and interpretation of negativity 
in the quasi-probability representations. In particular, 
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the use of negative probability allows us to avoid the con- 
tradictions which had lead to conclusions that quantum 
theory is contextual (via the Kochen-Specker theorem) 
and nonlocal (via Bell's theorem). 

We have discussed how negativity of states alone in 
some particular representation is a meaningless criterion 
for quantumness per se. Alternatively, we can accept 
some well defined notion of a truly quantum state (such 
as an entangled state) and show that negativity is nec- 
essary for such state in some representation. If we are 
ultimately to make use of our notion of quantumness, 
however, it needs to be defined for an operational task 
and hence incorporate the full operational formalism of 
quantum theory. That is, it must incorporate measure- 
ments (which we have done here) and dynamical transfor- 
mation (which wc have identified as a largely unexplored 
area of research). 
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Appendix A: Notations and conventions 

Quantum theory makes use of complex Hilbert spaces. 
If these spaces are finite dimensional, then they are equiv- 
alent to inner product spaces. Unless otherwise noted, 
a Hilbert space % will be assumed to have dimension 
d < oo and (for ip,(j) £ %) its inner product will be de- 
noted (ip,(j>). The following list defines some specials sets 
of linear operators acting on %. 

1. An operator U satisfying 

tfU = UU^ = 1, 

is called unitary. The set of all unitary operators 
is denoted U. 

2. An operator A satisfying 

(A^,<f>) = (if> t M), 

for all ip, (/> £ %, is called Hermitian. The set of all 
Hermitian operators is denoted M(H). 

3. An operator E satisfying 

OKfaEty) < 1, 

for all V ^ % i s called an effect. The set of all 
effects is denoted E(H). 



4. An effect p satisfying 

Tr(p) = 1 

is called a density operator. The set of all density 
operators is denoted B("H). 

5. A set of effects {Ek] which satisfy 

is called a POVM (positive operator valued mea- 
sure). The set of all POVMs is denoted POVM(^). 

6. An effect P satisfying 

P 2 = P 

is called a projector. The set of all projectors is 
denoted ¥(H). 

7. A set of projectors {Pk}, each of rank 1, which 
satisfy 

k 

is called a PVM (projector valued measure). The 
set of all PVMs is denoted PVM(H). 

Note that from these definitions we have F(H) C 
3(H) C E(H) C U(H) and PVM(H) C POVM(H). The 
set M(H) defines its own Hilbert space with inner prod- 
uct (for A,B £ M(H)) (A, B) := Tr(AB). The dimension 
of H(H) is d 2 . 

The following are some examples of important op- 
erators used in this paper. Consider the operator Z 
whose spectrum is spec(Z) = {u k : k £ %d}, where 
uj k = er Zj s kk . The eigenvectors form a basis for H and 
are denoted {4>k}- Consider also the operator defined by 
Xcf>k = 4>k+~Li where all arithmetic is modulo d. Define 
Y implicitly through [X,Z] = 2iY. The operators Z,X 
and Y are often called generalized Pauli operators since 
they are indeed the usual Pauli operators when d = 2. 
The parity operator is defined by P<f>k — 4>-k- 
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